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PREFACE 


This  volume  is  part  of  a  16-volume  set  that  summarizes  the  research  accomplishments  of 
faculty,  graduate  student,  and  high  school  participants  in  the  1992  Air  Force  Office  of  Sc  .ntific 
Research  (AFOSR)  Summer  Research  Program.  The  current  volume.  Volume  7  of  16,  presents 
the  final  research  reports  of  graduate  student  (GSRP)  participants  at  Armstrong  Laboratory. 

Reports  presented  herein  are  arranged  alphabetically  by  author  and  are  numbered 
consecutively  --  e.g.,  1-1,  1-2,  1-3;  2-1,  2-2,  2-3. 

Research  reports  in  the  16-volume  set  are  organized  as  follows: 

VOLUME  TITLE 

1  Program  Management  Report 

2  Summer  Faculty  Research  Program  Reports:  Armstrong  Laboratory 

3  Summer  Faculty  Research  Program  Reports:  Phillips  Laboratory 

4  Summer  Faculty  Research  Program  Reports:  Rome  Laboratory 

SA  Summer  Faculty  Research  Program  Reports:  Wright  Laboratory  (part  one) 

SB  Summer  Faculty  Research  Program  Reports:  Wright  Laboratory  (part  two) 

6  Summer  Faculty  Research  Program  Reports:  Arnold  Engineering  Development  Center;  Civil 
Engineering  Laboratory;  Frank  J.  Seiler  Research  Laboratory;  Wilford  Hall  Medical  Center 

7  Graduate  Student  Research  Program  Reports:  Armstrong  Laboratory 

8  Graduate  Student  Research  Program  Reports:  Phillips  Laboratory 

9  Graduate  Student  Research  Program  Reports:  Rome  Laboratory 

10  Graduate  Student  Research  Program  Reports:  Wright  Laboratory 

11  Graduate  Student  Research  Program  Reports:  Arnold  Engineering  Development  Center;  Civil 
Engineering  Laboratory;  Frank  J.  Seiler  Research  Laboratory;  Wilford  Hall  Medical  Center 

12  High  School  Apprenticeship  Program  Reports:  Armstrong  Laboratory 

13  High  School  Apprenticeship  Program  Reports:  Phillips  Laboratory 

14  High  School  Apprenticeship  Program  Reports:  Rome  Laboratory 

15  High  School  Apprenticeship  Program  Reports:  Wright  Laboratory 

16  High  School  Apprenticeship  Program  Reports:  Arnold  Engineering  Development  Center;  Civil 
EngineerLig  Laboratory 
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FEATURE  CONJUCTION  SEARCH  III  THREE- 
DIMENSIONAL  SPACE 

Jennifer  L.  Blume 
Department  of  Psychology 
Texas  Tech  University 

Abs.tiagt 

Assumptions  posed  by  Previc's  model  (1990) 
that  the  upper  visual  field  (UVF)  and  lover  visual 
field  (LVF)  are  specialized  for  far  and  near 
vision,  respectively,  were  investigated.  A  further 
assumption  that  the  left  brain  hemisphere,  and  thus 
the  right  visual  field,  as  well  as  the  UVF  are 
characterized  by  focal  processing  was  also  tested. 
Detection  speed  and  accuracy  of  random-dot 
stereogram  targets  in  a  feature  conjunction  task 
were  measured  as  a  function  of  target  position 
(lateral  half-field,  vertical  half-field, 
eccentricity,  and  depth).  Targets  appearing  in  tne 
UVF  and  the  right  half-field  did  result  in  faster 
responding  than  targets  in  the  LVF  or  left  half¬ 
field.  The  results  are  discussed  in  support  of 
Previc's  theoretical  model  of  visual  processing. 
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FEATURE  CONJUCTION  SEARCH  IN  THREE- 
DIMENSIONAL  SPACE 

Jennifer  L.  Blume 

INTRODUCTION 

A  recent  theoretical  paper  (Previc  1990) 
outlined  a  model  of  visual  processing  in  three 
dimensional  space  which  proposes  functional 
asymmetries  of  the  vertical  and  lateral  visual 
half -fields . 

Along  the  vertical  axis,  the  model  defines  a 
distinction  between  the  functions  and  character¬ 
istics  of  the  upper  visual  field  (UVF)  and  the 
lower  visual  field  (LVF).  Specifically,  Previc 
proposes  that  the  UVF  is  specialized  for  far  visual 
space  and  utilizes  a  focal  mode  of  processing  while 
the  LVF  is  specialized  for  near  space  and  is 
characterized  by  global  processing.  One  line  of 
evidence  supporting  these  relationships  is  that  as 
the  eyes  move  up  into  the  UVF,  or  down  into  the 
LVF,  they  reflexively  diverge  or  converge,  corre¬ 
sponding  to  the  eye  movements  necessary  when 


viewing  far  or 

near 

object 

( Heuer 

,  1988  ). 

The 

relationship  of 

far 

vision 

to 

the 

UVF  also 

is 

consistent  with 

the 

fact  that 

as 

objects  move 
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further  in  depth,  they  move  higher  in  the  visual 
field  (i.e.,  the  monocular  pictorial  depth  cue, 
"height  in  field"). 

Functional  distinctions  between  the  left  and 
right  visual  fields  seem  to  be  due  to  brain  func¬ 
tioning.  Several  studies  have  shown  that  the  left 
hemisphere,  and  thus  the  right  visual  field,  seems 
to  be  specialized  for  visual  search,  object  and 
shape  detection,  and  object  recognition  (Hellige 
1976,  Polich  1982,  &  Yund,  1990).  Some  evidence 
has  also  shown  that  global,  holistic  processing  is 
a  right  hemispheric  function  while  focal  processing 
is  a  left  hemispheric  function  (Delis,  1986).  The 
model's  propositions  regarding  left/right  asym¬ 
metries  define  the  right  visual  field  as  being 
specialized  for  focal  (far)  processing  and  the  left 
visual  field  for  global  (near)  processim  (Previc 
1990) . 

The  ecological  relevance  of  these  asymmetries 
is  evidenced  by  several  observations.  Most  visuo- 
motor  activity  occurring  in  near  space  land  the 
LVF)  is  optically  degraded  due  to  movement,  extreme 
diplopia  (causing  blurred  vision),  and  misaccomoda- 
tion  Which  in  turn  would  necessitate  a  non-focal. 
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global  processing  of  such  information  (Previc 
1990).  Conversely,  objects  in  far  vision  (and  the 
UVF)  tend  to  be  smaller,  clearer,  and  slower 
allowing  focal  processing  which  has  been  shown  to 
be  necessary  for  object  recognition  (Treisman  & 
Schmidt,  1982 ) . 

Visual  search  has  been  defined  by  some 
researchers  as  involving  two  separate  processes. 
Treisman  (1982)  and  Treisman  &  Gelade  (1980)  have 
proposed  that  searching  for  a  target  that  differs 
from  bacJcground  distractors  by  one  dimension  or 
feature  (a  square  among  circles)  is  a  parallel 
process.  Yet,  when  the  target  differs  by  conjunc¬ 
tion  of  two  features  (a  small  square  among  small 
circles  and  large  squares),  the  search  is  a  serial 
process  involving  focal  processes  (Treisman,  Sykes, 
&  Gelade  1977  ) . 

Assuming  that  feature  conjunction  tasks  do 
involve  focal  attention  and  that  focal  attention  is 
a  characteristic  of  the  UVF  (the  far  visual  system) 
and  the  right  visual  field  (the  left  hemisphere), 
Previc 's  model  would  predict  that  feature  conjunc¬ 
tion  search  is  better  performed  in  the  upper  right 
visual  field  and  poorest  in  the  lower  lett 


quadrant . 

In  the  present  study,  speed  and  accuracy  of 
detection,  recognition,  and  localization  in  a 
feature  conjunction  task  vere  measured  as  a 
function  of  depth,  eccentricity,  vertical  visual 
field,  and  lateral  visual  field.  According  to 
Previc's  model,  visual  performance  should  be  better 
when  the  target  appears  in  the  UVF  relative  to  the 
LVF,  and  the  right  visual  field  relative  to  the 
left  visual  field.  Since  Previc’s  model  states 
that  the  LVF  is  functionally  specialized  for  near 
vision  and  the  UVF  for  far  vision,  we  investigated 
the  relationship  between  visual  field  and  depth  of 
target.  The  model  predicted  that  performance 
should  be  better  for  far  targets  presented  in  the 
UVF  and  near  targets  presented  in  the  LVF. 

Further,  performance  is  expected  to  should  decrease 
as  eccentricity  increases. 

METHOD 

Subjects 

Ten  employees  of  the  USAF  Armstrong  Labora¬ 
tory  with  a  mean  age  of  25.4  participated  in  the 
study.  All  subjects  had  uncorrected  or  corrected 
(with  contact  lenses  only)  vision  of  20/30  or 
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better,  normal  stereoscopic  depth  perception  as 
measured  by  the  Armed  Forces  Vision  Testing 
Apparatus,  and  balanced  crossed  and  uncrossed 
disparity-  All  subjects  were  also  right  handed, 
right  footed,  and  right-eye  dominant  as  determined 
by  the  Miles  ABC  test  (Miles,  1929). 

Apparatus  and  Stimuli 

The  stimuli  were  generated  on  a  Silicon 
Graphics  3130  Iris  computer  system  and  were 
presented  on  a  Hitachi  60-hz  color  video  monitor. 

To  collect  reaction  time  (RT)  data,  the  Iris 
computer  was  linked  to  a  Zenith  Datasystems  Z-248 
computer  which  provided  a  clock  with  a  1-ms 
resolution. 

Subjects  viewed  the  display  with  their 
forehead  and  chin  positioned  in  a  standard  opthal- 
mological  support  brace;  eyes  were  centered 
vertically  on  the  screen,  stimuli  were  random  dot 
stereogram  (RDS)  stimuli,  which  were  created  using 
the  red-green  anaglyphic  technique.  A  red  filter 
(Kodak  Wratten  #29)  covered  the  right  eye  and  a 
green  filter  (Wratten  #58)  covered  the  left  eye. 
Response  information  was  relayed  through  buttons  on 
two  mouses,  a  "location  mouse"  and  a  "search 
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mouse",  resting  on  the  table  in  front  of  the 
subject . 

RDS  stimuli  consisted  of  diamonds  and  squares, 
each  in  two  sizes.  For  one  third  of  the  total 
trials,  each  target  vas  presented  at  a  near  depth 
(475-mm  apparent  distance)  a  coplaner  depth  (500  mm 
distance  to  screen),  and  a  far  depth  (522-mm 
apparent  distance).  Virtual  separation  of  near  and 
far  targets  from  the  fixation  plane  was  2.3-mm,  or 
19'  arc  disparity.  This  distance  was  kept  constant 
by  sliding  the  CRT  forward  or  bac)cward  according 
their  in  interpupillary  distance.  The  areas  of  the 
large  and  small  coplaner  stimuli  were  10.34>mm2  and 
5.17mm2,  respectively.  To  create  equal  projected 
sizes  of  all  targets  in  virtual  3-dimensional 
space,  the  areas  of  the  large  and  small  near 
targets  were  adjusted  to  10.68-mm2  and  5.52-mm2, 
and  the  large  and  small  far  targets  to  10.0C-mm2 
and  4.83-mm2  (see  Gulick  &  Lawson,  1976,  chap. 10). 
Tasks 

seaich  I&sJl 

Each  trial  began  with  a  fixation  cross  in  the 
center  of  the  screen  for  300-ms  followed  by  a  200- 
ms  presentation  of  one  of  the  target  stimuli,  which 
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was  then  followed  by  a  search  field.  The  search 
field  was  divided  into  four  quadrants  by  two  lines 
bisecting  the  screen  horizontally  and  vertically 
present  during  the  search  phase  only.  Nine  target 
positions  were  defined  in  each  quadrant  for  a  total 
of  36.  One  target  and  35  distractors  (the  other 
three  shapes)  were  presented  in  three  circular 
patterns  of  increasing  eccentricity.  For  each 
quadrant,  two  shapes  appeared  at  the  closest  eccen¬ 
tricity  (20-mm  from  the  fixation  point),  three  at 
the  middle  (40-mm  from  the  fixation  point),  and 
four  at  the  farthest  eccentricity  from  the  center 
(60-mm  from  the  fixation  point).  Each  of  the  4 
targets  (small  square,  small  diamond,  large  square, 
large  diamond)  was  represented  in  each  of  the  3 
eccentricities,  the  3  depths,  and  the  4  quadrants 
only  once  per  block  giving  a  total  "t  144  trials 
per  block.  Location  of  targets  and  distractors  was 
randomly  determined  throughout  the  block  of  trials. 

Each  subject  was  first  to  search  for  the 
target  and  respond  as  soon  as  possible  when  the 
target  was  found  by  pressing  the  left  button  on  the 
"search"  mouse  with  one  hand.  Upon  key  press,  a 
square  divided  into  four  quadrants  appeared  on  tne 
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screen.  The  subject  was  then  to  move  a  cursor  to 
the  quadrant  corresponding  to  the  location  where 
the  target  appeared  by  moving  the  "location"  mouse 
and  clicking  the  center  button  with  the  other  hand. 
The  subject  was  allowed  two  seconds  to  make  this 
second  response.  If  the  target  was  not  found  after 
three  seconds,  the  program  scored  the  trial  as  "no 
response"  and  moved  on  to  the  next  trial. 
Discrimination 

A  discrimination  task  was  also  presented  as  a 
control  to  ascertain  whether  asymmetries  shown  in 
the  search  task  were  caused  by  disct iminability 
differences  rather  than  the  asymmetries  in  the 
search  process. 

As  in  the  search  task,  a  target  shape  appeared 
for  200-ms,  but  was  followed  by  a  200-ms  present¬ 
ation  of  only  one  target  or  distractor  which 
randomly  appeared  at  one  of  the  36  locations  and 
three  depths.  The  subject  was  instructed  to 
determine  whether  the  shape  in  the  search  field 
appeared  the  same  in  size  and  shape  as  the  target 
previously  presented.  Responses  were  indicated 
with  the  "search"  mouse.  On  half  of  the  trials, 
the  shape  was  the  same  as  the  target;  on  the  other 
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half  a  distractor  shape  was  presented. 

Procedure 

Each  subject  participated  in  a  one  hour 
session  for  seven  days.  During  the  first  five 
sessions,  the  subjects  completed  the  search  task 
(three  blocks  per  session).  The  discrimination 
task  vas  completed  during  the  last  two  sessions 
(two  blocks  per  session).  Each  block  consisted  of 
144  trials  and  took  approximately  lb  minutes  to 
complete.  Intertrial  duration  vas  self-paced. 

All  experiments  were  conducted  in  a  dark  labor¬ 
atory.  Instructions  for  both  of  the  tasks 
emphasized  both  speed  and  accuracy. 

In  the  first  session,  subjects  were 
familiarized  with  the  search  task  with  a  gradual 
exposure  to  an  increasing  number  of  distractors  in 
the  search  field.  In  the  second  session  subjects 
completed  three  blocks  of  trials  with  the  full 
thirty-six  objects  in  the  search  field.  Data  was 
gathered  during  the  third  session.  The  fourth  and 
fifth  sessions  were  the  same  as  the  second  and 
third  except  the  subjects  switched  response  hands 
to  control  for  hand  dominance  effects  either  in 
detection  response  or  localization  response. 
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For  each  of  the  last  two  sessions,  the 
discrimination  task,  subjects  completed  two  blocks 
of  practice  and  two  blocks  of  actual  tests.  Hand 
of  response  was  switched  between  the  sixth  and 
seventh  sessions. 

For  both  the  search  and  the  discrimination 
tasks,  the  order  of  response  hand  was  counter¬ 
balanced  across  subjects. 

Design 

The  experimental  design  included  tour  main 
factors;  vertical  half-fields  (2),  lateral  naif- 
fields  (2),  depth  (3),  and  eccentricity  t3). 

RESULTS 

Data  were  analyzed  by  means  of  a  5  factor 
completely  repeated  measures  analysis  of  variance 
(ANOVA).  Separate  ANOVAs  were  performed  for  RT  and 
percent  correct.  Only  RT  of  correct  responses  were 
analyzed . 

Search  Task 
Reaction  Time 

For  the  search  task  RT  data,  all  4  main 
effects  were  statistically  significant.  Results 
indicated  a  main  effect  of  vertical  half-field 
IF(1,9 )=17 . 46;  p<. 00241,  lateral  half-field 


1-12 


IF(1,9 )=5.09;  p<. 05061,  depth  [F( 2,18)*5.81; 
p<.0113],  and  eccentricity  IF( 2, 18  )  =133 . 08; 
p<. 00011.  When  targets  were  located  in  the  UVF, 
average  RT  (1109.67-ms)  was  significantly  shorter 
than  those  in  the  LVF  ( 1256 . 96-ros ) ;  and  those  in 
the  right  visual  field  yielded  faster  RT  (1155.32- 
ms)  than  the  left  visual  field  (1211.31-ms).  The 
Duncan's  Multiple  Range  post  hoc  test  on  the  depth 
manipulation  revealed  that  the  RTs  for  near 
(1209.36-ms)  and  far  (1227.00-ms)  targets  were  not 
significantly  different,  but  the  coplaner  target 
RTs  (1113.60-ms)  were  significantly  faster  than 
both  the  near  and  far  targets.  A  post  hoc  test  on 
the  eccentricity  effect  showed  a  difference  between 
all  three  levels.  Targets  appearing  in  the  center, 
the  middle,  and  the  outer  eccentricities  resulted 
in  average  RTs  of  924.80-ms,  1204.01-ms,  1421.13- 
ms,  respectively. 

A  two-way  interaction  between  depth  and 
eccentricity  was  also  significant  IF( 4, 36 ) =3 . 82; 
p=. 01101.  The  near  and  far  conditions  both  gave  a 
longer  RT  as  eccentricity  increased  than  the 
copianer  conditions.  The  near  targets  presented  on 
the  center  eccentricity  resulted  in  longer  RT  than 
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the  far  targets,  but  for  targets  presented  on  the 
middle  and  outer  eccentricities,  the  far  targets 
resulted  in  a  slower  response  than  the  near 
targets.  This  interaction  which  shows  a  cross  over 
between  the  near  and  far  targets  may  simply 
reiterate  the  lack  of  a  significant  difference 
between  the  near  and  far  targets. 

Correct 

The  accuracy  data  for  the  search  task  resulted 
in  three  of  the  four  main  effects  reaching  sig¬ 
nificance.  Results  indicated  a  main  effect  of 
lateral  half-field  IF{ 1, 9 ) =9 . 22;  p<.0141J,  depth 
CF( 2, 18 ) =16 . 8;  p<.0001],  and  eccentricity 
tF( 2, 18 ) =9 . 88;  p<.0013].  Subjects  were  signifi¬ 
cantly  more  accurate  on  targets  presented  in  the 
right  visual  field  (91.296%)  than  the  left 
(89.097%).  The  lack  of  a  significant  difference  in 
accuracy  between  UVF  and  LVF  IF{ 1, 9 ) =1 . 24;  pc. 2942] 
suggests  that  there  is  no  speed/accuracy  trade-off 
involved  in  this  kind  of  task.  Post  hoc  tests  on 
eccentricity  showed  that  accuracy  for  the  targets 
placed  on  the  outer  ring  was  significantly  lower 
(87.465%)  than  for  the  middle  ring  (90.694%)  and 
the  inner  ring  (92.431%)  which  did  not  signifi- 
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cantly  differ  from  each  other.  Analysis  of  the 
depth  manipulation  shoved  the  near  stimuli 
resulting  in  significantly  lower  accuracy  (86.354%) 
than  the  coplaner  (92.743%)  and  the  far  targets 
(91.493%)  which  were  not  significantly  different 
from  each  other. 

Discrimination  Task 

React Ign  Tiaa 

Results  indicated  one  main  effect  of 
eccentricity  [F( 2, 18 ) =12 . 18;  p<.0005J.  Post  hoc 
tests  showed  that  the  outer  ring  was  associated 
with  significantly  slower  RT  (602.631-ms)  than  the 
middle  (588.651-ms)  and  inner  ring  (582.436-ms) 
which  were  not  significantly  different  from  each 
other . 

Percent  correct 

Discrimination  accuracy  data  gave  no  main 
effects  but  did  show  one  three-way  interaction 
between  depth,  vertical  half-fields,  and 
eccentricity  (F( 4,36 )=3 .97;  p<.0090J.  This  small 
effect  seems  to  be  a  chance  result  of  little 
interpretabie  meaning. 

DISCUSSION 

These  findings  are  consistent  with  Previc's 
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model  of  visual  processing  indicating  that  the  UVF 
is  specialized  for  visual  search  involving  feature 
conjunction  (and  thus  focal  processing),  and  the 
right  visual  field  is  specialized  for  focal 
processing.  Specifically,  results  indicated  that 
detection  of  feature  conjunction  targets  which 
utilize  focal  processing  mechanisms  were  detected 
taster  in  the  UVF  with  respect  to  the  LVF,  and  in 
the  right  visual  field  with  respect  to  the  left 
visual  field. 

The  depth  manipulation  did  not  give  the 
predicted  results  that  neat  targets  would  be 
detected  faster  and  more  accurately  in  the  LVF  and 
far  targets  in  the  UVF  as  there  was  not  a 
significant  interaction  between  depth  and  vertical 
half-field.  But  the  coplaner  depth  was  did  result 
in  significantly  faster  and  more  accurate 
performance  than  the  others.  This  may  be  due 
to  the  initial  presentation  of  the  target  appearing 
at  the  coplaner  depth,  i.e.  subjects  were  primed 
for  the  coplaner  depth  resulting  in  faster  RT  for 
copianer  targets. 

The  effect  of  eccentricity  was  consistent  with 
predictions;  specifically,  RT  was  slower  and  less 
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accurate  for  targets  presented  farther  avay  from 
the  fixation  point  than  for  those  presented  close 
to  the  fixation  point. 

Previc's  model  predicts  visual  processing 
asymmetries  as  resulting  from  processes  involved  in 
visual  search.  The  discrimination  task's  lack  of 
significant  main  effects  indicates  that  differences 
in  visual  field  processing  are  in  fact  due  to 
effects  of  search  rather  than  discr iminability 
processes . 

In  conclusion,  these  results  support  Previc's 
model  defining  distinct  functional  asymmetries 
of  processing  along  Che  vertical  axis  (UVF  and  LVF) 
and  the  lateral  axis  (right  and  left  visual 
fields )  . 
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ABSTRACT 


The  tools  of  fractal  analysis  have  been  used  in  an  attempt  to  better  understand  the  relationsbip 
between  dynamical  systems  analysts  and  cognitive  task  assessment  The  main  thrust  of  the  research  has  been 
to  determine  both  the  fractal  and  multifractal  scaling  behavior  of  an  electroencephatogram  (EEC),  and  to 
determine  if  this  scaling  behavior  is  sensitive  to  tasks  being  carried  out  by  a  subject  Univefsal  features  of  the 
EEC  have  been  examined  by  comparing  the  scaling  behavior  for  5  subjects,  each  of  whkb  performed  the  same 
tasks.  The  results  were  compared  with  known  results  for  random  noise,  in  an  effort  to  determine  the  amount 
of  randomness  in  the  signal  Upon  determination  of  the  details  of  the  scaling  behavior  of  the  EEC,  the 
sensitivity  of  this  behavior  to  cognitive  task  was  examined  by  comparing  EEC  recorded  under  12  different  task 
conditfens.  Four  30  second  EEC  segments  taken  from  a  single  3  minute  record  were  examined,  and  the  scaling 
behavk}r  for  each  segment  was  averaged  to  produce  a  single  result  for  the  given  task.  Scaling  exponents  and 
the  multifractal  spearum  was  computed  for  each  subject  under  all  task  conditions.  Although  the  determination 
of  the  scaling  exponents  and  the  multifracul  spectrum  was  successful  association  of  these  characteristics  with 
particular  tasks  yielded  ambiguous  results.  A  great  deal  of  the  ambiguity  stems  from  the  lack  of  sutionarity 
of  the  signal  since  the  scaling  behavior  is  not  uniform  from  one  section  of  the  EEC  to  another. 
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The  Relationship  Between  Serum 
2, 3, 7, 8-tetrachlorodibenzo-para-dloxin  Blood  Levels  and 
Psychological  Problems:  Social  Class,  Exposure  to  Toxins  and 

Mental  Distress*. 


Russell  P.D.  Burton 
Department  of  Sociology 
Kent  State  University 
Kent,  Ohio  44240 


ABSTRACT 

This  caper  has  expands  on  the  work  of  the  1991  AFHS  and 
explores,  with  newly  constructed  dependent  measures,  with  updated 
assumptions  regardino  consistency  across  measures,  and  with  more 
fully  specified  models,  the  association  between  dioxin  and  mental 
distress.  Results  differ  considerably  from  the  1991  AFHS  report. 
While,  by  no  means  conclusive,  results  produced  from  these 
analysis  strongly  suggest  that  the  body  burden  of  dioxin  is 
related  to  mental  distress. 
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INTRODUCTTON 


Little  is  known  about  the  short  or  lonq  term  psychosocial 
effects  of  acute  or  chronic  exposure  to  2, 3,7,8- 
tetrachlorodibenzo-para-dioxin  (dioxin)  in  humans.  The  Air  Force 
Health  Study  (AFHS)  is  the  only  study  which  allows  the 
examination  of  the  long  term  psychosocial  effects  of  chronic 
exposure  to  dioxin.  The  AFHS  is  an  on  goina  20  year  longitudinal 
study  of  the  current  health  of  Air  Force  veterans  of  Operation 
Ranch  Hand.  This  unit  was  responsible  for  the  aerial  spraying  of 
herbicides  in  Vietnam,  one  such  herbicide,  of  a  general  class  of 
herbicides  which  contained  trace  contaminants  of  dioxin,  was 
"Agent  Orange."  These  men  ether  flew  or  maintained  the  aircraft 
or  directly  handled  these  contaminated  herbicides  for  a  period  of 
one  to  three  years.  Thus,  the  AFHS  which  began  in  1982,  some  12 
years  after  the  discontinued  use  of  these  herbicides,  presents  an 
excellent  opportunity  to  examine  the  long  term  effects  of 
exposure  to  these  herbicides  and,  in  particular,  their  toxic 
contaminant  dioxin. 

The  AFHS  is  an  extremely  large  undertaking  and  in  the  past 
10  years  the  researchers  involved  in  this  project  have  published 
and  presented  several  papers  and  technical  reports  regarding  the 
possible  effects  of  exposure  to  these  contaminated  herbicides 
(e.g.  Wolfe  et  al.  1990;  Michalek  et  al.  1990;  Michalek  et  al. 
1992).  This  project  was  directed  at  the  examination  of  the 
effects  of  dioxin  in  11  general  outcome  areas  with  over  250 
individual  dependent  measures. 
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The  last  technical  report  represented  a  marked  improvement 
in  the  study  of  exposure  to  dioxin  (Roegner  et  al.  1987).  Prior 
to  this  report  no  direct  measure  of  exposure  to  dioxin  was 
available.  Thus,  research  was  limited  to  the  classic 
epidemiological  method  of  comparing  those  individuals  thought  to 
be  exposed  (i.e.  Ranch  Hands)  to  control  individuals  (referred  to 
as  comparisons)  who  were  not  occupationally  exposed  to  herbicides 
or  dioxin  in  Vietnam.  However,  between  1985  and  1987  a  technique 
was  introduced  which  allowed  the  direct  measurement  of  exposure 
through  the  examination  of  individual  blood  samples,  for  levels 
of  dioxin.  Because  dioxin  has  a  rather  long  half  life  in  the  body 
(about  7  years)  current  levels  and  original  exposure  levels  could 
be  obtained,  with  reasonable  accuracy.  Thus,  an  important 
methodological  improvement  was  made  and  for  the  first  time  true 
dose  response  estimates  could  be  made. 

While  this  improvement  in  measurement  has  contributed 
immensely  to  the  understand ina  of  the  effects  of  dioxin,  a 
careful  review  of  the  psycholooical  assessment  section  of  the 
1991  AFHS  report  suqqests  three  other  areas  where  additional 
improvement  could  be  made.  First,  dependent  measures,  which  were 
designed  to  examine  psychological  problems,  should  be  constructed 
consistent  with  their  theoretical  development.  Second,  outdated 
assumptions  regarding  consistencies  across  three  different  types 
of  psychological  instruments  should  be  drooped.  Finally,  research 
models  should  be  constructed  based  on  current  theoretical 
foundations  in  the  sociology  of  mental  distress.  The  purpose  of 
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this  paper  is  to  show  that  after  these  chanaes  are  irriDlemented 
the  relationshiD  between  dioxin  on  psychological  problems  will  no 
longer  be  "inconsistent  and  isolated",  as  stated  bv  Roeoner,  et 
al.  (1991:  1..9),  but  consistent  and  prevalent. 

This  paoer  is  two-fold  in  its  focus.  First,  research  models 
based  on  current  research  regarding  mental  distress  are 
constructed.  Second,  the  1987  AFHS  data  is  reanalyzed.  The 
results  are  discussed  in  light  of  the  previous  theoretical 
arquments  and  hypothesis. 

RESEARCH  MODELS 

While  most  models  are  misspecified  to  a  certain  extent, 
misspecification  of  research  models  leads  to  biased  estimates  and 
should  be  avoided  wherever  possible  (Pedhazur,  1982).  In  the 
Dsycholoaical  section  of  the  AFHS  1991  technical  report,  only 
five  independent  variables  were  considered  for  candidates  in  the 
construction  of  their  research  models:  aae,  race,  education, 
current  alcohol  consumption  and  history  of  alcohol  consumption. 

A  brief  review  of  the  literature  of  the  epidemiology  and 
sociology  of  mental  distress  (see  below)  suggests  that  other, 
common  but  critical,  independent  variables  should  have  been 
included  (e.g.  individual  income). 

SOCIAL  STATUS  AND  MENTAL  DISTRESS 

One  of  the  strongest  and  most  consistent  relationships  noted 
in  the  epidemiology  of  mental  distress  is  that  social  status  is 
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negatively  related  to  mental  distress  (Kessler,  1982)^.  Two 
general  arguments  have  been  developed  as  possible  explanations 
for  this  observed  relationship,  they  are  termed  social  drift  and 
social  causation. 

The  drift  or  selection  argument  suooests  that  psychological 
disorders  result  in  a  drifting  of  mentally  distressed  individuals 
to  downward  social  status  positions  or  the  climbina  up  in  social 
status  of  non  mentally  distressed  individuals.  In  both  cases  the 
result  is  a  residue  of  mentally  distressed  individuals  at  the 
lower  social  status  positions.  Alternatively,  however,  the 
social  causation  argument  suggests  that  there  is  something 
inherent  in  the  lower  social  positions,  such  as  traumatic  life 
events  or  intellectual  flexibility,  which  causes  mental  distress. 
Again,  the  lower  social  statuses  have  higher  levels  of  mental 
distress.  While  both  arguments  are  compelling,  data  and 
theoretical  details  required  to  sort  out  such  discussions  are 
extremely  complex.  However,  the  1982  work  of  Kessler  has,  to 
some  extent,  heloed  clarify  this  relationship. 

Kessler  utilized  eight  broad  based  normal  population  surveys 
in  his  examination  of  the  effects  of  differing  aspects  of  social 
status  on  mental  distress  (Kessler  1982).  Considering  individual 
income,  family  income,  education  and  occupational  prestige 
Kessler  noted  that  for  males  the  strongest  and  essentially  the 


‘  From  the  outset  we  should  distinguish  social  status  from 
social  class;  the  latter  refers  to  one's  relationship  to  the 
means  of  production  and  by  the  former  we  limit  our  discussion  to 
education,  Income  and  occupational  status. 


only  social  status  predictor  of  mental  distress  was  that  of 
individual  income  (Kessler,  1982).  For,  females  however,  the 
strongest  effect  was  demonstrated  by  education.  Kessler 
concludes  his  paper  by  lin)tinq  these  results  to  the  classic  drift 
vs.  social  causation  debate. 

Kessler  notes  that  because  the  social  selection  argument 
emphasizes  achieved  social  status,  such  as  individual  income  or 
occupational  prestige,  the  results  obtained  for  men,  in  his  eight 
surveys,  are  more  constant  with  the  social  selection  argument 
than  with  the  social  causation  argument.  That  is,  one  would 
anticipate,  if  achieved  social  status  is  related  to  mental 
distress,  that  individuals  move  beyond  their  more  "ascribed" 
social  status  (i.e.  education)  and  in  doing  so  one  would  then 
observe  that  income  becomes  a  stronger  predictor  of  mental 
distress  then  education. 

Alternatively,  because  several  aspects  of  the  social 
causation  argument  emphasize  socialization  processes  which  occur 
relatively  early  in  the  life  course,  such  as,  intellectual 
flexibility  or  cooing  responses,  Kessler  suggests  that  the  social 
selection  argument  mav  be  more  consistent  with  his  results  noted 
for  women  (Kessler,  1982).  That  is,  if  there  is  something 
inherent  about  being  in  the  lower  social  statuses  such  as 
receiving  socialization  which  results  in  low  intellectual 
flexibility,  and  this  intellectual  flexibility  has  an  impact  on 
how  one  responds  to  stress,  and  is  this  intellectual  flexibility 
is  relatively  fixed  for  life,  one  would  then  expect  that 
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education  would  be  a  stronqer  a  oredlctor  of  mental  distress  than 
income . 

Thus,  because  the  data  for  this  oaper  employ  only  men  we 
would  anticipate,  whether  one  accepts  Kessler's  social  selection 
and  causation  explanation  or  not,  that  income  would  be  the 
strongest  and  only  social  status  predictor  of  mental  distress. 
SOCIAL  ROLES  AND  MENTAL  DISTRESS 

One  of  the  more  recent  areas  of  investigation  in  the 
epidemiology  of  mental  distress  is  that  of  social  roles.  Work  by 
Thoits  (1983,  1986),  Menaghan  (1989),  Burton,  Rushing  and  Ritter 
(1992)  and  others  have  demonstrated  that,  in  general,  social 
roles  such  as  beina  married  and  employed,  are  neqatively  related 
to  mental  distress.  While  there  are  some  differences  by  race  and 
aender,  the  overall  results  suggest  that  the  social  roles  of 
being  married  and  employed  are  beneficial  to  mental  health 
(Reskin  and  Coverman  1985;  Mirowsky  and  Ross  1989a).  However, 
the  effects  of  being  a  parent  and  having  the  children  living  in 
the  household  seems  to  be  positively  related  to  mental  distress 
for  women  but  unrelated  to  mental  distress  for  men  (Mirowsky  and 
Ross  1989a ) . 

Thus,  in  the  AFHS  one  would  expect  that  the  social  roles  of 
employee  (i.e.  working),  and  spouse  (i.e.  being  married)  to  be 
strongly  and  negatively  related  to  mental  distress.  One  would 
also  anticipate  that  being  a  parent  and  having  the  child  in  the 
home  would  not  be  related  to  mental  distress  or  would  be  slightly 
positively  related  to  mental  distress. 
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ALCOHOL  CONSUMPTION  AND  MENTAL  DISTRESS 

The  relationshiD  between  alcohol  consumotion  and  mental 
distress  is  both  theoretically  and  emoirically  unclear. 
Theoretically  individuals  who  are  depressed,  stressed  or  anxious 
may  be  motivated  to  drink  alcohol  to  alleviate  these  unwanted 
feelinqs  (williams  1970).  Thus,  current  drinkinq  and  particular 
problem  drinkinq  may  be  associated  with  depression  and  anxiety. 
Recent  research  indicates  that  at  least  in  the  immediate  short 
term  (i.e.  while  under  intoxication),  in  particular  situations, 
at  particular  times  of  the  week  and  the  day,  some  stress 
reduction  may  occur  (Orcutt  1991).  However,  lonq  term  drinkinq 
particularly  at  hiqher  uses  may  result  in  a  exacerbation  of 
mental  distress  rather  than  a  reduction  (Biener,  1987). 

While  somewhat  unclear,  for  these  data,  qive  the 
crossectional  nature  and  the  time  frame,  we  would  expect  that 
both  current  drinkinq  and  drinkinq  history  should  be  oositively 
related  to  mental  distress. 

HYPOTHESES 

The  followinq  list  of  anticipated  relationships  should  be 
demonstrated  with  these  data. 

1. )  Income  will  not  only  be  negatively  related  to  mental  distress 
but  it  will  be  the  only  aspect  of  social  status  which  is  related 
to  mental  distress. 

2. )  Conversely  education  will  have  no  effect  on  mental  distress. 

3. )  Two  social  roles,  that  is,  being  an  employee  (i.e.  working) 


and  being  a  spouse  (i.e.  married),  will  be  negatively  related  to 
mental  distress. 

4. )  The  social  role  of  being  a  parent  (i.e.  havinq  a  child)  will 
not  be  related  to  mental  distress  or  will  have  a  marginal 
Dositive  effect. 

5. )  Both  current  drinkinq  and  history  of  drinkinq  will  be 
Dositivelv  related  to  mental  distress. 

6. )  Dioxin  will  have  a  oositive  effect  on  mental  distress. 

Durinq  the  Vietnam  war  a  group  of  individuals  were  involved 
in  a  military  operation,  code  named  Operation  Ranch  Hand.  The 
overall  responsibility  of  these  individuals,  from  January  1962  to 
October  1971,  was  the  aerial  spraying  of  defoliant  herbicides. 

The  primary  herbicide,  of  a  general  class  of  herbicides,  which 
contained  trace  contaminates  of  dioxin,  was  code  named  Aqent 
Orange.  The  AFHS  is  an  ongoinq  epidemioloq ical  investioat ion  to 
determine  the  adverse  effects  of  exposure  to  these  herbicides  for 
the  Ranch  Hand  arouo.  The  AFH.S  was  desiqned  as  a  20  year 
lonaitudinal  studv  starting  in  1982  with  5  follow  ups  to  occur  at 
3  years  (1985),  5  years  (1987),  10  years  (1992),  15  years  (1997), 
and  20  years  (2002),  What  follows  is  a  brief  explanation  of  the 
population  and  sampling  procedures  employed,  for  more  information 
see  Roegner,  et  al.  1991). 

All  Ranch  Hands  were  matched  to  at  lest  one  Comparison  but 
up  to  10  Comparisons,  based  on  age,  race,  and  military 
occupation.  The  Comparison  subjects  were  drawn  from  a  group  of 


AlE  Force  veterans  involved  in  c-130  aircraft  missions  in 
Southeast  Asia.  It  was  determined  that  these  individuals  were 
suitable  as  Comparisons  because  they  received  similar  training, 
served  in  Southeast  Asia,  had  sufficient  numbers  and  conducted 
similar  work  except  that  they  were  not  occupationally  exposed  to 
herbicides . 

RESmJlS. 

Tables  1  through  3  oresent  results  from  the  analysis  of  the 
SCL-90-R.  Note  that  all  regression  results  presented  through  out 
this  paper,  except  those  noted,  are  in  their  unstandardized 
format.  Therefore,  direct  comoarison  may  be  made  on  the  same 
variable  across  different  populations  with  the  same  dependent 
variable.  Comparisons  may  not  be  made  however,  across  different 
variables  within  the  same  or  different  populations  irrespective 
of  the  dependent  variable.  Table  1  displays  the  number  of  valid 
responses,  the  means  and  standard  deviations  on  all  variables  for 
all  Ranch  Hands  and  all  Comparisons.  These  two  groups  were 
matched  on  age,  race  and  education  and  we  find  no  significant 
variation  in  these  variables  across  the  two  groups.  In  general 
all  variables,  except  drinking  history,  combat  exposure  and 
dioxin  levels,  are  not  significantly  different  across  the  two 
groups  . 

Table  2  Panel  A  introduces  the  multivariate  analysis  of  the 
SCL-90-R  for  the  Comparison  group  and  Table  2  Panel  B  displays 
the  multivariate  equations  for  all  Ranch  Hands.  With  in  each 
cell  the  superscripts  a-d  represent  the  probability  values,  in 
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descending  order  (a  =  ,001;  b  =  .01;  c  =  ,05;  d  =  .1),  for 
testing  the  hypothesis  that  the  slope  for  the  variable  in  that 
cell  is  equal  to  zero.  When  the  superscript  e  is  found  in  a  cell 
that  indicates  the  slope  for  that  variable  is  significantly 
different  (p  <  .1)  for  Ranch  Hands  and  Comparisons;  that  is, 
their  exists  an  interaction  between  that  variable  and  the  two 
groups  (i.e.  Ranch  Hands  and  Comparisons).  Additionally,  the  R* 
for  every  equation  is  reported  at  the  far  right  of  each  row. 

Recalling  that  these  two  groups  were  matched  on  age,  race 
and  level  of  education,  the  most  important  information  to  be 
drawn  from  Table  2  is  the  overall  differences  between  the 
equations  in  Panel  A  and  Panel  B.  There  are  two  general  levels 
of  differences  across  these  two  panels.  First,  there  are  the 
differences  which  are  statistically  significantly  across  the  two 
groups  (i.e,  the  cells  with  e  in  them).  Second  are  the  non¬ 
significant  differences  which  demonstrate  constant  patterns  of 
differences  across  the  two  groups.  While  the  first  level  of 
differences  are  clearly  what  we  should  rely  on  to  draw  our 
conclusions,  at  this  early  stage  in  the  analysis,  it  is  some 
times  also  helpful  to  note  constant  trends  in  the  date,.  Both  at 
the  level  of  significance  and  the  trend  level  we  note  strilting 
differences  in  these  two  sets  of  equations,  i.e.  Panel  A  and 
Panel  B. 

Bv  far  the  most  obvious  difference  across  these  two  matched 
groups  is  the  differential  effects  of  education  and  income,  with 
eleven  of  the  twelve  possible  interaction  terms  for  education 
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beinq  significant  and  seven  of  the  ten  possible  interaction  terms 
for  income  beina  significant.  Panel  A  demonstrates  the 
hypothesized  relationships,  for  a  group  of  males,  between 
individual  income,  education  and  mental  distress.  That  is,  drawn 
from  the  work  of  Kessler  (1982)  we  anticipated  that  individual 
income  would  be,  if  not  the  only,  the  major  social  status 
predictor  of  mental  distress  (see  hypothesis  1  and  2  above). 
However,  we  note  that  the  relationship  is  completely  reversed  for 
the  Ranch  Hands.  That  is,  for  the  Ranch  Hands,  education  is,  in 
most  cases  the  only,  and  in  all  cases  the  strongest  social  status 
predictor  of  mental  distress  (standardized  coefficients  not 
shown).  We  begin  to  note  the  first  in  a  series  of  differences 
between  these  two  groups  and  thus  the  potential  effects  of 
exposure  to  Aaent  Orange  on  the  Ranch  Hands. 

Directing  the  focus  to  the  effects  of  social  roles,  it  is 
clear  that  the  next  significant  difference  between  these  two 
groups  of  eguations  is  the  extremely  strong,  positive  and 
consistent  effect  of  being  a  parent  on  mental  distress  observed 
in  the  Ranch  Hand  group  but  not  in  the  Comparison  group.  In 
fact,  of  the  possible  ten  interaction  terms  for  the  role  parent, 
across  these  two  equations,  six  are  significantly  different. 

Thus,  it  seems,  once  again  that  the  hypothesized  effects  of  the 
independent  variables  in  the  model  (hypothesis  3  and  4)  are 
supported  for  the  Comparison  group,  but  are  not  supported,  for 
the  parent  and  working  roles,  in  the  Ranch  Hand  group. 
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In  reference  to  drinking  history  the  effects  in  both  groups 
are  as  anticipated.  Current  drinking  has  little  effect  on  mental 
distress  in  both  groups,  but  the  effects  are  reversed  by  group. 
For  the  Comparison  group  current  drinking  is  positively  related 
with  mental  distress,  as  anticipated,  but  for  the  Ranch  Hands  the 
effects  are  negativelv  related  to  mental  distress.  It  is  noted 
again  that  support  is  found  in  the  Comparison  qroup  but  not  in 
the  Ranch  Hand  group,  for  the  current  drinking  hypotheses  (i.e. 
hypothesis  5 ) . 

To  review  the  result  thus  far,  it  is  clear  that  the  Ranch 
Hand  and  Comparison  groups,  although  matched  on  age,  military 
occupation  and  race,  produce  completely  different  equations 
regarding  the  prediction  of  mental  distress.  Significant 
differential  effects,  by  group,  on  mental  distress  for  education, 
income  and  the  parent  role,  as  well  as  a  strong  but  non 
significant  trend  in  the  differential  effect  of  the  employment 
role,  have  been  noted.  Because  a  critical  covariate  which  was 
not  adjusted  for  across  these  two  groups  is  dioxin  levels, 
assumed  to  be  the  result  of  exposure  to  Agent  Orange,  it  is 
possible,  but  certainly  not  conclusive,  that  exposure  to  Agent 
Orange  and  the  resulting  dioxin  levels  may  be  driving  these 
differential  effects.  The  effects  of  dioxin  are  considered  next. 

Table  3  presents  these  equations.  The  most  interesting 
aspect  of  these  equations  is  the  fact  that  dioxin  has  a 
significant  positive  effect  for  almost  all  mental  distress 
outcomes.  In  fact  dioxin  is  a  marginal  predictor  in  3  of  the  12 


3  -14 


(p  <  0.1)  equations  but  is  a  strong  predictor  in  6  of  the 
remaining  9  equations  (d  <  0.05).  Such  effects  are  beyond  any 
limits  set  by  random  variation  with  several  outcome  variables  and 
thus  we  find  clear  support  for  hypothesis  6  regarding  the 
positive  effects  of  dioxin  on  mental  distress.  Current  drinking 
is  negatively  related  to  mental  distress.  Additionally  drinking 
history  is  positively  related  to  mental  distress.  A  marginal  and 
negative  effect  of  education  and  marital  status  is  also  noted. 

Thus,  a  clear  dose  response  betwee*"  mental  distress  and 
dioxin  appears  to  exist  at  15  PPTR  and  areater.  At  this  point  it 
is  clear  that  dioxin  body  burdens  of  greater  than  15  have  a 
significant  effect  on  mental  distress. 

CONCLUSION 

In  conclusion  this  paper  has  expand  on  the  work  of  the  1991 
AFHS  and  explored,  with  newly  constructed  dependent  measures, 
with  updated  assumptions  regerding  consistency  across  measures, 
and  with  more  fullv  specified  models,  the  association  between 
dioxin  and  mental  distress.  Results  differ  considerably  from  the 
1991  AFHS  report  (see  Roegner  et  al  1991).  While,  by  no  means 
conclusive,  results  produced  from  these  analysis  strongly  suggest 
that  the  body  burden  of  dioxin  is  related  to  mental  distress. 
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TABLE  1 


DESCRIPTZVES  FOR  ALL  VARIABLES 
FOR  ALL  RANCH  KANOS  AND  COMPARISONS 


RANCH  HANDS _  _ 


N 

X 

SO 

N 

X 

SO 

SOMAT 

8S4 

828.899 

470.615 

783 

808.203 

464.449 

OBSES 

854 

828.296 

468.450 

783 

808.061 

462.697 

INTER? 

854 

815.391 

448.333 

783 

822.937 

459.310 

DEP 

854 

815.708 

461.500 

783 

822.590 

221.153 

ANX 

854 

820.937 

440.809 

783 

816.804 

440.089 

HOST 

854 

826.197 

435.092 

783 

811.150 

433.401 

PHOB 

854 

816.383 

343.322 

783 

821.854 

358.330 

PAR 

854 

809.994 

417.098 

783 

828.822 

433.060 

psrc 

854 

814.846 

416.376 

783 

823.531 

428.853 

GSI 

854 

817.227 

471.208 

783 

820.934 

471.776 

PSOI 

854 

819.901 

463.231 

783 

818.017 

456.972 

PST 

854 

817.369 

468.264 

783 

820.779 

475.382 

EDUC 

848 

.482 

.500 

778 

.500 

.500 

AGE 

854 

43.796 

7.538 

783 

44.027 

7.956 

BLACK 

854 

.052 

.221 

783 

.060 

.238 

DRINK  y 

846 

2.236* 

1.950 

781 

2.438 

1.817 

DRINK  D 

850 

-1.010 

1.181 

783 

-.920 

1.209 

INC 

854 

7.054 

4.174 

783 

6.913 

3.980 

COM 

852 

4.937* 

2.311 

772 

3.812 

2.446 

WORK 

854 

.871 

.335 

783 

.870 

.337 

MAR 

854 

.840 

.367 

782 

.857 

.351 

PAR 

854 

.453 

.498 

783 

.469 

.499 

L06PPT 

854 

3.868* 

1.637 

783 

1.820 

1.337 

*  ■  Significant  Diffarancaa  Across  Groups  (P<.1) 
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In  Utero  Taste  Aversion  Conditioning 
in  the  Rat  Fetus 


Jed  S.  Cohen 

Summer  Research  Assistant 
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Trinity  University 


Abstract 


The  ability  to  teach  rat  fetuses  a  taste  aversion  in  utero  was 
examined.  Fetuses  were  conditioned  by  exposing  them  to  a  .15% 
saccharin  solution  and  to  a  .8%  lithium  chloride  solution.  Initial 
experimental  results  indicate  that  saccharin  consumption  in  all 
groups  was  low,  possibly  due  to  neophobia.  In  later  test  weeks, 
however,  the  saccharin  and  lithium  chloride  treatment  group  was 
found  to  consume  greater  quantities  of  saccharin.  Although,  the 
aforementioned  group  consumed  much  less  saccharin  than  the 
other  groups,  they  consumed  far  more  than  expected  for  animals 
with  a  conditioned  taste  aversion.  If  the  protocol  did  lead  to  a 
successful  taste  aversion,  its  presence  was  very  subtle.  As  of  this 
writing,  the  protocol  is  being  adjusted  to  ensure  the  presence  of  a 
taste  aversion.  Future  experiments  will  focus  on  the  ability  to 
transfer  the  taste  aversion  from  a  trained  fetus  to  a  naive  adult  male 
rat  in  order  to  reflne  neural  transplantation  techniques. 
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INTRODUCTION 


It  has  been  shown  that  newborn  rat  pups  are  capable  of  odor  and 
taste  aversive  learning  in  utero.  Stickrod,  Kimble,  and  Smotherman 
(1982)  have  shown  that  aversive  learning  abilities  deveiop 
prenatally.  They  have  reported  that  rat  fetuses  will  form  a 
taste/odor  aversion  when  exposed  to  apple  juice  in  utero  followed 
by  lithium  chloride.  Stickrod  further  demonstrated  that  fetal  rats 
can  acquire  and  retain  a  conditioned  taste  aversion.  He  showed  that 
rat  fetuses  at  20  days  of  gestation  are  capable  of  associative  learning 
which  can  be  demonstrated  more  than  two  weeks  after  birth.  The 
purpose  of  this  experiment  is  to  confirm  whether  or  not  a  taste 
aversion  can  be  learned  in  utero  and  to  confirm  its  effectiveness. 

We  are  expanding  this  original  paradigm  by  initiating  conditioning 
on  the  18th  day  of  gestation,  testing  more  than  20  day  after  birth, 
and  relying  on  different  concentrations  of  the  conditioned  stimulus 
and  unconditioned  stimulus  to  determine  their  effects.  Subsequent, 
more  ambitious  experiments  will  examine  the  feasibility  of  using 
neural  transplant  techniques  to  transfer  a  simple  memory  (the  taste 
aversion)  from  a  trained  fetus  to  a  naive  adult  male  rat  in  order  to 
elucidate  how  the  brain  encodes  memory,  refine  neural 
transplantation  techniques,  and  allow  for  a  description  of  the  neural 
components  of  a  tissue  containing  a  specific  memory. 

METHOD 

This  experiment  was  conducted  to  confirm  that  a  taste  aversion 
could  be  learned  in  utero.  In  order  to  expose  the  fetuses  to  the 
CS  (conditioning  stimulus:  saccharin)  and  the  UCS  (aversive 
stimulus;  lithium  chloride),  surgery  was  performed  on  pregnant, 
time-mated,  Sprague  Dawley  rats  on  Day  18  of  gestation.  The 
pregnant  female  was  anesthetized  with  sodium  pentobarbital 
(60mg/kg).  Metophane  was  administered  via  nose  cone  as  needed 
to  maintain  surgical  anesthesia.  The  abdomen  was  shaved  and 
treated  with  betadine  scrub  and  solution.  Both  horns  of  the  uterus 
were  exposed  through  a  midline  iaprotomy  under  aseptic 
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conditions.  A  32-gauge  needle  was  used  to  inject  10  microliters  of 
the  .15%  saccharin  (CS)  solution  or  distilled  water  (control)  into  the 
amniotic  sac  adjacent  to  the  nose-mouth  area  of  each  fetus.  The 
fetuses  were  then  intraperitoneally  injected  with  5  microliters  of 
.81%  lithium  chloride  (UCS;  8.1  g  UCl  per  1000  ml 
distilled  water)  or  0.09  %  saline  (control).  This  dose  of  lithium 
chloride  is  known  to  cause  gastrointestinal  difficulty  in  the  fetus  and 
is  therefore  used  as  the  aversive  stimulus  in  establishing  the  taste 
aversion.  All  injections  were  done  under  visual  guidance.  The 
uterus  was  replaced  after  the  last  injection  and  maneuvered  as  close 
as  possible  to  its  original  U-shape  to  prevent  any  delivery 
complications  at  binh.  The  incision  was  sutured  and  treated  with 
Lidocaine,  a  topical  analgesic,  to  reduce  immediate,  post-surgical 
pain.  The  fetuses  were  allowed  to  come  to  term.  Conditioned  male 
rats  were  weaned  and  placed  in  their  own  cages  25  days  after  birth. 
At  this  time,  the  female  dam  and  the  conditioned  females  were 
euthanized  using  carbon  dioxide  gas. 

Four  different  treatment  groups  (TGs)  were  established.  TGI 
received  saccharin  into  the  amniotic  fluid  followed  by  lithium 
chloride  injected  intraperitoneally  (SAC/LiCl).  TG2  received  distilled 
water  into  the  amniotic  fluid  foilowed  by  saline  (H20/SAL).  TG3 
received  saccharin  into  the  amniotic  fluid  and  saline  (SAC/SAL). 

TG4  received  distilled  water  into  the  amniotic  fluid  followed  by 
lithium  chloride  (H20/UC1). 

The  taste  aversion  testing  of  the  conditioned  males  was  begun 
immediately  after  weaning.  The  rats  were  allowed  to  eat  and  drink 
ad  lib  for  one  week  after  weaning  and  before  being  placed  on  a 
timed  drinking  schedule.  After  being  placed  on  the  drinking  cycle, 
their  water  bottles  were  removed  at  2  P.M.  every  Monday.  Then, 
water  was  available  for  only  30  minutes  per  day  at  the  same  time 
every  day  for  three  days  (Tuesday,  Wednesday,  and  Thursday).  This 
scheduling  was  required  to  ensure  the  rats  would  drink  when  liquids 
were  offered  to  them  on  the  test  day.  On  day  #4  of  the  cycle 
(Friday:  Test  Day  #1),  the  rats  were  offered  a  choice  between 
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distilled  water  and  the  saccarin  solution  described  earlier.  The  2 
bottles  containing  the  two  liquids  were  placed  at  opposite  ends  of 
the  cage  top.  The  drinking  solutions  were  placed  in  50  ml 
centrifuge  tubes  (Nalgene);  each  tube  was  then  stoppered  with  a 
rubber  test  tube  stopper  containing  a  sipper  tube.  The  left  and  right 
bottle  positions  were  switched  three  times  during  the  drinking 
period  (at  30  seconds,  2.5  minutes,  and  3.75  minutes)  in  order  to 
eliminate  side  preferences  and  fixations.  The  amount  of  each  liquid 
consumed  was  recorded  at  the  end  of  the  30  minute  drinking 
period. 


RESULTS 

The  data  collected  after  each  test  day  were  organized  and  then 
entered  into  RS/1,  a  scientific  applications  spreadsheet.  The  data 
were  organized  along  several  lines:  the  rats  were  indentified  by 
number,  the  type  of  injections  they  received,  the  date  the  test 
occurred,  the  amount  of  water  and  saccharin  each  rat  consumed, 
and  the  percentage  of  water  and  saccharin  consumed  by  each  of  the 
rats.  The  most  important  indicator  of  a  successful  taste  aversion 
(TA)  was  the  percentage  of  saccharin  consumed  by  each  rat  during 
the  test  day.  Data  interpretation  relied  heavily  on  this  figure. 

Regardless  of  the  type  of  injections  the  rats  received  while  in 
utero  their  initial  saccharin  consumption  in  all  treatment  groups  is 
low.  (See  graph  #TA_EXP1).  Lt.  Col.  Mickley  attributed  this 
phenomenon  to  neophobia  --  although  the  rats  have  been  exposed 
repeatedly  to  water  prior  to  the  test  day,  the  test  day  is  the  first 
opportunity  the  rats  have  had  to  consume  saccharin  at  will.  Thus, 
they  tend  to  consume  the  more  familiar  substance  (water)  in  greater 
quantity  early  in  the  experiment  Interestingly  enough,  this 
phenomenon  is  also  seen  in  the  control  animals  which  have  received 
saccharin  in  utero  with  saline.  Why  would  these  animals 
demonstrate  a  neophobia  to  saccharin  when  they  have  had  some 
experiences  with  saccharin?  Although  we  can  not  explain  it  directly, 
this  evidence  does  lend  some  credence  to  the  neophobia  argument 
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The  data  specifically  show  a  neophobia  for  the  saccharin/saline 
(SAC/SAL)  treatment  group  (control).  During  test  week  #1,  for 
example,  the  SAC/SAL  control  group  had  a  saccharin  consumption 
rate  of  16%  that  was  suprisingly  close  to  that  of  the  H20/SAL 
control  group's  consumption  rate  of  13%.  (The  H20/SAL  group 
was  expected  to  show  the  lowest  saccharin  consumption  rate 
because  it  has  never  been  exposed  to  saccharin;  this  assumption 
should  hold  true  if  one  gives  credence  to  the  notion  of  neophobia. 

If  the  taste  aversion  conditioning  were  working,  the  SAC/LiCl  group 
should  show  the  lowest  saccharin  consumption.) 

Also  unexpected  was  our  discovery  that  the  rats  which  received 
the  lithium  chloride  (LiCl)  injections  were  consuming  more 
saccharin  than  their  counterparts  who  received  the  saline  control 
injections  (see  graph  #TA_EXP1).  During  test  week  #1,  the  SAC/LiCl 
group  consumed  25.9%  saccharin  while  the  H20/SAL  group  --  the 
group  expected  to  have  the  highest  saccharin  consumption  —  had  a 
13.4%  saccharin  consumption  rate.  During  test  week  #2,  the 
SAC/LiCl  group  consumed  46%  saccharin  while  the  H20/SAL  group 
consumed  only  22.3%  saccharin.  The  H20/SAL  group  almost 
manages  to  almost  tie  the  SAC/LiCl  group  during  test  week  #5  (64% 
vs.  64.7%)  and  actually  manages  to  consume  more  saccharin  during 
test  week  #7  (71.9%  vs.  70.2%).  See  graph  #TA_STAT  for  details. 
Based  on  this  evidence,  it  is  possible  that  LiCl,  administered  to  cause 
gastrointestinal  distress,  may  actually  have  an  amnesic  effect  on  the 
rats. 

The  clearest  separation  between  the  saccharin  and  the  control 
groups  (H20/SAL  and  SAC/SAL)  occurs  on  test  week  #3  (see  graph 
#TA_EXP1).  Notice  that  the  SAC  /LiCl  group  has  the  greatest 
saccharin  consumption  rate  (67.7%)  compared  with  the  H20/SAL 
(36.1%)  and  the  SAC/SAL  (47.2%)  groups.  Thus,  the  group  that 
should  demonstrate  the  taste  aversion  consumed  more  saccharin 
than  the  control  groups. 
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The  only  finding  consistent  with  the  original  TA  hypothesis  was 
discovered  by  Lt.  Col.  Mickley  when  he  plotted  the  slopes  and 
intercepts  of  each  of  the  lines  in  the  #TA_EXP1  graph  on  RS/1.  The 
SAC/LiCl  group  exhibits  an  early  saccharin  consumption  peak 
followed  by  a  decrease  in  consumption.  The  slope  for  this  line  is 
more  shallow  than  those  of  the  control  groups  or  for  the  H20/UC1 
group.  The  shallow  slope  implies  that  the  SAC/LiCl  rats  do  not 
consume  as  much  saccharin  in  the  long  term  as  do  the  other  rats 
because  the  consumption  rate  remain  relatively  stable  for  the 
duration  of  the  eight  week  test  cycle.  Saccharin  consumption  rates 
for  the  other  three  treatment  groups  are  more  sporadic,  but 
generally  show  a  small  initial  saccharin  consumption  followed  by  an 
increased  saccharin  consumption  later  in  the  test  cycle. 

Because  the  initial  results  were  not  as  promising  as  expected,  the 
protocol  was  adjusted  slightly  with  the  saccharin  and  lithium 
chloride  solutions  being  doubled  so  that  their  effects  could  be  more 
easily  seen.  Early  data  suggest  that  the  double  concentration 
groups  (which  contained  only  a  few  rats)  consume  very  low  levels  of 
saccharin.  The  consumption  rates  are  so  low,  in  fact,  that 
differentiation  between  the  experimental  and  control  groups  was 
virtually  impossible. 


CONCLUSION 

If  a  taste  aversion  was  indeed  developed,  its  presence  was  very 
subtle.  The  only  evidence  we  found  to  support  the  original 
hypothesis  was  that  the  SAC/LiCl  treatment  showed  an  early 
saccharin  consumption  peak  followed  by  a  decrease  in 
consumption.  Nevertheless,  the  SAC/LiCl  group  consumed  a  great 
deal  of  saccharin  for  animals  that  should  have  a  taste  aversion  and, 
therefore,  consume  little,  if  any,  saccharin  at  all. 

The  experimental  protocol  is  being  modified  so  that  a  taste 
aversion  can  be  better  demonstrated  as  in  Smotherman's  studies. 
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Taste  aversion  conditioning  is  now  being  done  on  the  20th  day  of 
gestation  instead  of  the  18th.  Lower  saccharin  concentrations 
(0.1%)  are  being  used  for  training  and  testing  purposes.  Apple  juice 
is  also  being  used  in  lieu  of  the  saccharin  in  some  cases,  an  idea 
borrowed  from  Smotherman,  et.  al.  Next,  the  in  utero  saccharin  and 
lithium  chloride  injections  are  being  separated  in  time.  Initially, 
each  fetus  was  injected  with  saccharin  in  the  mouth  area  followed 
immediately  by  lithium  chloride  in  the  gut.  Now,  three  fetuses  are 
injected  with  saccharin  and  then  the  lithium  chloride  injections  are 
administered.  Additional  planned  changes  include:  weaning  the 
pups  earlier  so  that  taste  aversion  testing  can  commence  sooner,  a 
one  bottle  (saccharin  only)  test  which  may  reveal  effects  obscured 
by  neophobia,  and,  finally,  testing  the  animals  more  times  during 
the  week. 
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Summary  of  Taste  Aversion  Data 


0  Rat  No 

1  Treatment 

2  %SAC_T1 

3  %SAC_T2 

4  %SAC_T3 

5  %SAC_T4 

6  %SAC_T5 

7  %SAC_T6 

1 

MEAN 

1 

0.258929 

0.460945 

0.677562 

0.693538 

0.647635 

0.700657 

2 

SEM 

0.041523 

0.068912 

0.058358 

0.038676 

0.056137 

0.041601 

3 

+  SEM 

0.300452 

0.529856 

0.735920 

0.732214 

0.703772 

0.742259 

4 

-SEM 

0.217406 

0.392033 

0.619205 

0.654861 

0.591498 

0.659056 

D 

6 

MEAN 

2 

0.134616 

0.223457 

0.360902 

0.567650 

0.640472 

0.668475 

7 

SEM 

0.016061 

0.031629 

0.051438 

0.061023 

0.063779 

0.066369 

8 

+  SEM 

0.150677 

0.255086 

0.412340 

0.628673 

0.704251 

0.734844 

9 

10 

11 

-SEM 

0.118555 

0.191828 

0.309465 

0.506628 

0.576692 

0.602107 

MEAN 

3 

0.162959 

0.360249 

0.472073 

0.559031 

0.645937 

0.736037 

12 

SEM 

0.018454 

0.034077 

0.043311 

0.053906 

0.054435 

0.056543 

13 

+  SEM 

0.181412 

0.394326 

0.515384 

0.612937 

0.700372 

0.792580 

14 

15 

16 

-SEM 

0.144505 

0.326173 

0.428762 

0.505125 

0.591503 

0.679494 

MEAN 

4 

0.235453 

0.435434 

0.545553 

0.659269 

0.686635 

0.791764 

17 

SEM 

0.030108 

0.055480 

0.046606 

0.043510 

0.041067 

0.038410 

18 

+  SEM 

0.265561 

0.490913 

0.592159 

0.702779 

0.727702 

0.830174 

19 

20 
21 

-SEM 

0.205346 

0.379954 

0.498947 

0.615759 

0.645568 

0.753354 

MEAN 

5 

0.183036 

22 

SEM 

0.031260 

23 

+  SEM 

0.214286 

24 

25 

26 

-SEM 

0.151786 

MEAN 

6 

0.135376 

27 

SEM 

0.028954 

28 

+  SEM 

0.164331 

29 

-SEM 

0.106422 

Treatment:  l=Sac+LiCl,  2=H20+Sal,  3=Sac+Sal, 

4=H20+LiCl,  5=Sac+LiCl  (2X  Concentration)  6=Sac+Sal  (2X  cone) 
7=H20+LiCl  (2X  cone)  8=AppleJ. +LiCl  (2X  cone) 
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Summary  of  Taste  Aversion  Data 

0  Rat  No  8  %SAC_T7 
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1 

MEAN 

0.702652 

2 

SEM 

0.050195 

3 

+  SEM 

0.752847 

4 

-SEM 

0.652457 

5 

6 

MEAN 

0.719506 

7 

SEM 

0.062061 

8 

+SEM 

0.781567 

9 

-SEM 

0.657445 

10 

11 

MEAN 

0.822441 

12 

SEM 

0.053001 

13 

-^SEM 

0.875442 

14 

-SEM 

0.769440 

15 

16 

MEAN 

0.820259 

17 

SEM 

0.034832 

18 

+  SEM 

0.855091 

19 

-SEM 

0.785427 

20 

21 

MEAN 

22 

SEM 

23 

+  SEM 

24 

-SEM 

25 

26 

MEAN 

27 

SEM 

28 

+  SEM 

29 

-SEM 

Treatment:  l=Sac+LiCl,  2=H20+Sal,  3=Sac+Sal, 

4=H20+LiCl,  5=Sac+LiCl  (2X  Concentration)  6=Sac+Sal  (2X  cone) 
7=H20+LiCl  {2X  cone)  8=AppleJ. +LiCl  (2X  cone) 
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ANIMAL  EMOTIONALITY  AND  RHESUS  MACAQUE  (MacaCA  mulatta) 

VOCALIZATIONS 


James  W.  Collins,  M.S. 

Department  of  Psychology 
University  of  Georgia 
Athens,  GA  30605 

Abstract 

A  state-of-the-art  computer-based  bioacoustics  laboratory  was 
established  for  the  recording,  digitization  and  analysis  of 
rhesus  macaque  <Macaca  mulatta)  calls,  their  visual  display,  and 
playback.  Two  trips  were  taken  to  record  vocalizations  under 
naturalistic  conditions  from  a  free-ranging  rhesus  colony  (Morgan 
Island,  South  Carolina)  maintained  by  L.A.B.S.,  in  Yemassee, 

South  Carolina.  Also,  recordings  of  rhesus  monkey  calls  under 
differing  experimental  and  control  conditions  were  made  from 
animals  at  Brooks  AFB.  The  literature  on  animal  emotionality  was 
reviewed  and  is  reported  in  the  form  of  an  annotated 
bibliography.  The  great  promise  that  research  into  the 
expression  of  emotionality  by  animals  via  their  vocalizations 
holds  for  detecting  the  emotional  state  of  animals,  especially 
primates,  has  not  yet  been  tapped.  It  is  our  aim  to  achieve  a 
higher  level  of  understanding  of  the  emotional  component  of 
animal  vocalizations.  Meeting  this  goal  will  significantly 
expand  our  understanding  of  the  internal  states  of  animals  as 
well  as  enable  us  to  comprehend  future  assessments  of 
research/captive  animal  wellbeing. 
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ANIMAL  EMOTIONALITY  AND  RHESUS  MACAQUE  (Macaca  mulatta) 

VOCALIZATIONS 


James  W.  Collins,  M.S. 


Department  of  Psychology 
University  of  Georgia 
Athens,  GA  30605 


During  my  12  week  tenure  at  the  Armstrong  Laboratory  at  Brooks 
Air  Force  Base  in  San  Antonio,  Texas,  I  participated  in  research 
projects  within  the  Performance  Extrapolation  Function  Division 
directed  by  Dr.  Michael  R.  Murphy.  We  were  interested  in 
measuring  rhesus  macaque  (Macaca  mulatta)  vocalizations, 
establishing  a  model  vocal  repertoire  for  this  species,  and 
ascertaining  how  these  vocalizations  could  be  used  as  an  index  of 
"emotionality"  for  rhesus  monkeys. 

The  first  task  was  the  establishment  of  a  state-of-the-art 
computer-based  bioacoustics  laboratory  for  the  recording, 
digitization  and  analysis  of  macaque  calls,  their  visual  display, 
emd  playback.  This  involved  the  purchase,  delivery,  and  set-up 
of  various  individual  components  into  a  single,  functionally- 
integrated  system.  Components  of  this  laboratory  are:  a 
portable  digital  audio  recorder  with  microphone  and  power  pack 
for  field  and  lab  vocalization  recording,  a  digital  cassette 
recorder  (DAT)  for  playback  and  analysis  of  recorded  audio,  a  Kay 
CSL  Model  4300  computerized  speech  lab,  an  IBM  compatible  486  PC 
AT  computer  with  a  SuperVGA  display  monitor,  and  a  laser  printer 
for  hard  copy  of  visually-represented  calls. 

Two  trips  were  taken  to  record  vocalizations  under  naturalistic 
conditions  from  a  free-ranging  rhesus  colony  (Morgan  Island, 

South  Carolina)  maintained  by  L.A.B.S.,  in  Yemassee,  South 
Carolina.  Also,  recordings  of  rhesus  monkey  calls  under 
differing  experimental  and  control  conditions  were  made  from 
animals  at  Brooks  AFB  during  my  tenure.  The  resulting  audio 
recordings  were  brought  back  to  the  lab  and  dissected,  analyzed, 
grouped  into  acoustic  and  functional  categories,  and  visually 
presented.  This  work  is  still  in  progress. 

The  literature  on  animal  emotionality  was  reviewed  (see  Appendix 
A) .  Subjects  examined  in  these  studies  include  rodents,  domestic 
animals,  and  nonhuman  primates.  As  a  topic  of  study  since  the 
1930 's,  there  are  many  published  studies  of  animal  emotionality. 
Most,  however,  are  concerned  only  peripherally  with  the 
"emotionality"  exhibited  by  the  species  under  investigation. 
Rodent  studies  usually  use  an  established  method  of  measuring 
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animal  emotionality  as  the  dependent  variable  in  an  examination 
of  drug  effects  or  the  effects  of  brain  region  ablations. 

Studies  of  emotionality  in  domestic  animals  are  concerned  with 
the  effects  of  emotionality  in  the  comparative  fitness  of  the 
species  for  domestication.  Primate  studies  of  emotionality  often 
only  indirectly  contemplate  the  emotionality  of  the  subjects 
studied,  focusing  instead  on  vocalizations  produced  in  a  variety 
of  circiimstances .  Animal  emotionality,  a  complex  subject  area, 
has  not  yet  received  focused  attention  in  published  literature, 
but  may  soon  with  the  emergence  of  an  international  focus  on 
animal  wellbeing.  Supplemental  bibliographies  are  provided  for 
reference. 

For  as  long  as  there  has  been  an  interest  in  animal  behavior, 
there  has  been  a  correspondent  interest  in  the  subject  of  "animal 
emotionality".  Consideration  of  animal  emotionality  as  a  subject 
of  scientific  study  with  the  field  of  psychology  began  in  the 
1930 's,  with  an  examination  of  emotionality  in  the  rat  (Hall, 

1934)  .  Further  studies  have  sought  to  devise  techniques  for  the 
measurement  of  emotionality  in  rodents  {Frances,  1988)  as  well  as 
consider  their  theoretical  importance  (Broadhurst,  1957)  . 

Primate  emotionality  has  not  been  as  extensively  studied, 
although  there  has  recently  been  increased  interest  in  this  area 
as  a  result  of  considerations  of  research/ captive  primate 
wellbeing  {Barnard  &  Hou,  1988;  Bayne  et  al.,  1992;  Chamove, 

1989)  .  The  emotional  components  of  primate  vocalizations  have 
been  addressed  by  some  studies  (Masataka,  1989;  Robinson,  1967), 
while  others  have  used  other  indeces  of  so-called  "emotional 
responsiveness"  in  nonhvimn  primates  {Line  et  al.,  1989;  Martin  et 
al.,  1988;  Troisi,  et  al.,  1991). 

An  annotated  bibliography  of  important,  and  sometimes 
controversial,  research  concerned  with  animal  emotionality  was 
prepared  {see  Appendix  B) .  The  time  span  of  research  considered 
is  from  1934-1992,  but  more  emphasis  is  placed  on  recent  {1980’s 
and  1990 ’s)  studies.  Subject  species  of  these  many  papers 
include  mice  and  rats  of  various  strains,  pigtail  {Macaca 
nemestrina) ,  rhesus  {Macaca  mulatta) ,  and  Japanese  macaques 
{Macaca  fuscata) ,  and  the  domestic  dog  and  pig.  The  Roman  high- 
avoidance,  Roman  low-avoidance,  Maudsley  Reactive,  and  Maudsley 
Non-Reactive  rat  strains  are  of  special  interest  here  because 
they  have  been  selectively  bred  for  differences  in  behavioral 
responses  that  have  been  linked  to  a  basic  animal  "emotionality" 
{Broadhurst,  1975) .  Emotionality  is  not  always  the  central  focus 
of  these  papers,  but  is  a  significant  component  of  either  the 
data  collected  or  the  theoretical  orientation  of  the  work. 

Despite  being  the  most  widely-used  nonhiiman  primate  research 
subject,  the  rhesus  macaque  (Macaca  mulatta)  vocal  repertoire  has 
been  relatively  unexplored  and  has  certainly  not  been  described 
to  any  great  level  of  detail  (Rowell,  1962) .  We  believe  that  the 
prime  means  for  the  expression  of  "emotionality"  by  the  rhesus  is 
it's  vocalizations.  Early  research  (Robinson,  1967)  has  shown 
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emotionality  to  be  a  prime  component  of  the  vocal  expressions  of 
this  species.  Likewise,  the  emotional  component  of  the 
vocalizations  of  other  species  has  also  been  long  known  (Noirot  & 
Pye,  1969). 

The  great  promise  that  research  into  the  expression  of 
emotionality  by  animals  via  their  vocalizations  holds  for 
detecting  the  emotional  state  of  animals,  especially  primates, 
has  not  yet  been  tapped.  It  is  our  aim  to  achieve  a  higher  level 
of  understanding  of  the  emotional  component  of  animal 
vocalizations.  Meeting  this  goal  will  significantly  expand  our 
understanding  of  the  internal  states  of  animals  as  well  as  enable 
us  to  comprehend  future  assessments  of  research/captive  animal 
wellbeing. 
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THE  PREDICTIVE  VALIDITY  OF  AUTOMATED  PERSONNEL  TESTS  AND  ASVAB 
TESTS  FOR  PERFORMANCE  ON  AIR  FORCE  TUTORS 


David  Dickter  Mary  Roznowski 

Graduate  Associate  Assistant  Professor 

Department  of  Psychology 

Abstract 

In  two  studies,  regression  analyses  provided  evidence  that 
the  computerized  Automated  Personnel  Tests  (APTs)  are  useful 
supplements  to  the  paper-and-pencil  Armed  Services  Vocational 
Aptitude  Battery  (ASVAB)  tests  for  predicting  ability  to  learn 
from  Air  Force  tutors.  Further,  the  APTs  may  even  outperform 
ASVAB  tests  in  predicting  the  criteria  when  used  alone. 
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THE  PREDICTIVE  VALIDITY  OF  AUTOMATED  PERSONNEL  TESTS  AND  ASVAB 
TESTS  FOR  PERFORMANCE  ON  AIR  FORCE  TUTORS 

David  Dickter  and  Mary  Roznowski 


INTRODUCTION 

Computers  possess  unique  capabilities  for  measuring  h\iman 
cognitive  information  processing  skills,  including  the  ability  to 
administer  complex  tests  of  working  memory  and  present  graphical 
representations  of  spatial  relations.  These  features  are  not 
available  with  the  traditional  paper-and-pencil  mode  of  testing. 
Indeed,  it  would  be  difficult  to  overestimate  the  potential 
utility  of  computerized  measures  in  conducting  research  on  human 
cognitive  abilities.  Computerized  tests  may  not  only  supplement 
or  replace  paper-and-pencil  tests,  but  may  be  used  in  research 
that  sheds  new  light  on  the  nature  and  measurement  of  cognitive 
information  processing  and  intelligence  (Fleishman,  1988?  Hunt, 
1982;  Kyllonen  &  Christal,  1989;  Sternberg,  1986). 

An  ongoing  research  effort  that  has  been  developing  novel 
ways  of  measuring  human  cognitive  abilities  is  the  Learning 
Abilities  Measurement  Program  (LAMP),  located  at  the  Air  Force's 
Armstrong  Laboratory.  LAMP  utilizes  computer  administration  of 
cognitive  tasks  in  order  to  measure  abilities  that  include  speed 
of  information  processing  and  working  memory  capacity.  The 
program  maintains  the  dual  goals  of  modeling  cognitive  learning 
and  developing  improved  tests  for  selection  and  classification 
(Kyllonen  &  Christal,  1989). 

The  purpose  of  this  study  was  to  measure  the  predictive 
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validities  of  scores  from  the  Automated  Personnel  Testing  (ATP) 
program,  a  subset  of  measures  from  the  Cognitive  Abilities 
Measurement  (CAM)  test  battery  (Kyllonen  et  al.,  1990),  as  well 
as  to  compare  them  to  those  from  paper-and-pencil  Armed  Services 
Vocational  Aptitude  Battery  (ASVAB) .  Studies  were  conducted  with 
two  samples.  Performance  scores  from  two  computerized  Air  Force 
tutors,  electricity  concepts  (in  Study  1)  and  flight  engineering 
knowledge  and  skills  (in  Study  2),  served  as  criteria.  These 
tutors  measure  the  acquisition  of  knowledge  and  skills  that  are 
relevant  to  many  Air  Force  specialties  and  are  used  to  study 
conceptual  learning  and  skill  learning  (Campbell,  1990;  Christal, 
1991;  Shuts,  1992). 

STUDY  1 

In  the  first  study,  the  APT  and  the  Air  Force  Qualification 
Test  (AFQT;  a  composite  based  on  a  subset  of  tests  from  the 
ASVAB)  were  used  to  predict  performance  on  the  Electricity  tutor. 
Data  were  obtained  from  a  database  of  scores  on  computerized 
tests  administered  as  part  of  a  study  of  learning  environments 
(Shuts,  1992) .  Because  APT  tests  were  timed  examinations,  two 
summary  scores  were  used;  percent  correct  on  all  test  items  (PC) 
and  median  response  time  on  all  items  (RT) .  The  total  time 
subjects  took  to  complete  the  tutor  (T-TUT)  was  recorded. 

Mastery  was  defined  as  correctly  solving  several  consecutive 
problems  on  a  given  principle  (Shute,  1992) .  Following  the 
tutor,  subjects  were  given  a  test  on  these  principles,  and 
accuracy  on  the  test  (POSTACC)  was  measured.  T-TUT  and  POSTACC 
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constituted  the  criteria  used  in  this  study. 


METHODOLOGY 

Data  were  obtained  for  344  subjects  who  completed  the  ASVAB, 
a  battery  of  CAM  tests,  and  a  computerized  tutor  designed  to 
teach  subjects  Ohm's  law  and  other  principles  of  electricity. 

The  tutor  was  self-paced,  allowing  subjects  as  much  time  as 
needed  in  order  to  master  the  material.  Subjects  were  employees 
at  a  local  temporary  service  agency  and  were  paid  for  their 
participation.  Approximately  15%  were  women,  39%  were  White,  43% 
were  Hispanic  and  10%  were  Black. 

Regression  analyses  were  performed  in  order  to  examine  the 
validities  of  the  APT  tests,  as  well  as  the  unique  contributions 
of  the  two  types  of  APT  measures  over  the  AFQT.  The  dependent 
variables  were  the  time  spent  taking  the  tutor  (T-TUT) ,  and 
accuracy  on  a  test  of  knowledge  acquired  from  the  tutor 
(POSTACC) .  The  first  criterion  measures  how  long  subjects  took 
to  acquire  an  understanding  of  the  principles  of  electricity. 

The  second  criterion  measures  the  number  of  correct  solutions, 
addressing  their  actual  percentage  correct  rather  than  competence 
alone.  In  order  to  keep  the  number  of  variables  in  the 
regression  equation  to  a  minimum,  AFQT  score  used  to  represent 
the  ASVAB  scores.  Given  that  the  AFQT  is  used  for  selection 
decisions  in  the  Air  Force,  and  that  AFQT  scores  in  the  dataset 
correlated  .94  with  the  sum  of  the  ASVAB  scores,  this  decision 
seemed  to  be  well  justified. 
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Two  sets  of  regression  analyses  were  conducted.  lue  to  the 
large  number  of  tests  in  the  APT  battery  (19),  the  number  of 
variables  used  in  the  regression  equation  was  minimized  as  much 
as  possible.  In  the  first  set,  the  APT  tests  were  grouped  into 
composites  based  on  six  factors  measured  in  the  battery: 
Processing  Speed,  Working  Memory,  Fact  Learning,  Skill  Learning, 
Induction,  and  General  Knowledge.  Composites  were  formed  by 
adding  together  scores  from  the  quantitative,  verbal  and  spatial 
tests  within  each  factor.  All  APT  tests  were  used  except  two 
General  Knowledge  quantitative  tests,  for  which  data  were  not 
available  at  the  time  of  the  study.  Thus,  six  composites  were 
formed  from  the  mean  percent  correct  scores  (PC) .  These 
composites  were  entered  simultaneously  into  the  regression 
equations. 

Because  subjects  were  encouraged  to  answer  individual  test 
items  c[uickly  and  had  time  limits,  another  set  of  composites  used 
the  median  response  time  scores  (RT) .  Composites  were  formed 
based  on  the  six  factors,  in  the  same  manner  as  the  PC 
composites.  The  RT  composites  were  entered  simultaneously  into 
the  regression  equations. 

RESULTS 

Tables  1  through  3  present  the  results  of  regression 
analyses  using  PC  composites,  RT  composites,  and  AFQT  to  predict 
POSTACC.  Tables  1  and  2  report  validities  for  these  scores  for 
POSTACC,  as  well  as  the  incremental  validities  obtained  when  an 
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APT  score  is  added  to  another  APT  score  or  to  AFQT.  Table  3 
reports  the  incremental  validities  for  the  three  types  of  scores 
when  each  is  added  to  the  remaining  two  scores.  In  order  to 
provide  a  complete  report  of  the  data,  the  regression  tables 
include  the  R-squares  adjusted  for  the  differing  numbers  of 
variables  used  for  PC,  RT  and  AFQT. 

The  PC  composite  has  the  largest  R-square  for  predicting 
POSTACC  (54.6%  of  the  variance),  followed  by  the  AFQT  (52.7%)  and 
RT  (14.8%).  Percent  Correct  adds  unique  variance  to  both  RT  and 
AFQT  (40.3%  and  5.5%,  respectively).  Moreover,  PC  makes  a  unicpie 
contribution  beyond  both  of  these  combined  (4.5%).  Clearly, 
Percent  Correct  scores  result  in  a  very  useful  measure,  both  on 
their  own  and  as  an  addition  to  AFQT.  It  is  noteworthy,  however, 
that  the  AFQT  also  shows  incremental  validity  over  the  APT, 
adding  unique  variance  over  PC  and  RT  combined  (3.6%).  Thus, 
both  the  computerized  and  paper-and-pencil  tests  are  useful.  On 
the  other  hand,  response  time  on  APT  tests  is  not  useful  for 
predicting  variance  over  and  above  that  accounted  for  by  accuracy 
scores  (PC  or  AFQT).  (See  Table  3.) 

Table  1.  Validity  and  incremental  validity  of  composites  of  PC 
and  RT.  Criterion;  Accuracy  on  test  following  the  tutor  (POSTACC) 


PC 

RT 

PC  added 

to  RT  added 
RT 

to 

PC 

Multiple 

R 

.739 

.384 

.742 

.742 

R  Square 

.546 

.148 

.551 

.551 

R  Square 

Change 

.403* 

.005 

F  Change 

56.380 

8.121 

41.208 

.510 

Adiusted 

R  Scfuare 

.537 

.130 

.532 

.532 

*  Indicates 

significance 

at  p<.01. 

Other 

incremental  F 

tests  are  not 
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significant. 


Table  2.  Validity  of  AFQT  and  incremental  validity  of  PC  and  RT. 


AFQT 

PC  added  to  RT  added  to 

Multiple  R 

.726 

.763 

.737 

R  Square 

.527 

.582 

.542 

R  Square  Change 

.055* 

.016 

F  Change  318 

.278 

6.173 

1.604 

Adiusted  R  Souare 

.525 

.572 

.531 _ 

Table  3.  Incremental 

validity  for  AFQT,  PC  and  RT. 

AFQT  added  to 

PC  added  to 

RT  added  to 

Multiple  R 

.766 

.766 

.766 

R  Square 

.587 

.587 

.587 

R  Square  Change 

.036* 

.045* 

.005 

F  Change  23 

.969 

4 

.970 

.591 

Adiusted  R  Souare 

.568 

.568 

♦  568 _ 

Similar  analyses  were  carried  out  for  the  time  criterion. 
Tables  4  to  6  present  the  results  of  regression  analyses  using 
subjects'  total  time  to  complete  the  tutor  as  the  dependent 
variable. 


Table  4.  Validity  and  incremental  validity  of  composites  of  PC 
and  RT.  Criterion:  Total  time  spent  on  the  tutor  fT-TUT) _ 


PC 

RT 

PC  added  to 
RT 

1  *  ^  -  g  .■■■  , 

RT  added  to 

PC 

Multiple 

R 

.702 

.503 

.743 

.743 

R  Square 

.  494 

.253 

.552 

.552 

R  Square 

Change 

.299* 

.059* 

F  Change 

46.118 

16.067 

30.930 

6.084 

Adiusted 

R  Souare 

.483 
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TABLE  5.  Validity  of  AFQT  and  incremental  validity  of  PC  and  RT. 
Criterion;  Total  time  spent  on  the  tutor  fT-TUT^ _ 


AFQT  PC  added  to  RT  added  to 
_ hEQl _ AFQT _ 


Multiple 

R 

.680 

.723 

.715 

R  Square 

.462 

.522 

.511 

R  Square 

Change 

.060* 

.049* 

F  Change 

248.129 

5.970 

4.746 

Adjusted  R  Square  .460 _ .511 _ .499 


Table  6.  Incremental  validity 
Criterion:  Total  time  soent  on 

for  AFQT,  PC  and  RT. 
the  tutor  rT-TUT^ 

AFQT  added 

to 

PC  added  to 

RT  added  to 

APT 

Multiple 

R 

.752 

.752 

.752 

R  Square 

.566 

.566 

.566 

R  Square 

Change 

.014* 

.055* 

.044* 

F  Change 

8.743 

5.834 

4.636 

Adjusted 

R  Scruare 

.546 

.546 

_ ,^46 _ 

As  observed  for  the  accuracy  criterion  (POSTACC) ,  the  PC 
composite  has  the  largest  R-square.  Percent  Correct  explains 
49.4%  of  the  variance  in  time  spent  using  the  tutor.  AFQT 
explains  46.2%  of  the  variance  in  T-TUT,  and  RT  explains  25.3%. 
Given  that  RT  is  a  latency  measure,  one  might  expect  that  it 
would  account  for  unique  variance  in  T-TUT,  even  though  it  did 
not  add  unique  variance  for  the  accuracy  criterion.  Indeed,  RT 
added  4.4%  unique  variance  to  PC  and  AFQT  combined.  On  the  other 
hand,  Percent  Correct  adds  the  most  unique  variance.  (See  Table 
6.) 

As  in  the  analyses  with  the  POSTACC  criterion,  AFQT  also 
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adds  unique  variance  to  PC  and  RT  combined  in  predicting  the  time 
criterion,  though  the  variance  added  is  small  (1%).  Thus,  in 
both  analyses,  the  APT  tests  add  more  unique  variance  to  AFQT 
than  AFQT  adds  to  them,  and  Percent  Correct  on  the  computerized 
measures  is  the  best  overall  predictor. 

Another  way  of  keeping  the  number  of  variables  in  the 
regression  equation  at  a  reasonable  level  was  to  select  the  best 
of  the  APT  tests  for  analysis.  The  tests  were  selected  based  on 
their  zero-order  correlations  with  the  criteria.  Although  this 
might  appear  to  inflate  the  Multiple  R  for  APT  tests  falsely,  the 
analyses  reported  below  will  show  that  this  was  not  the  case. 

The  results  are  similar  to  those  using  composites  of  APT  scores. 
Moreover,  any  such  method  of  selecting  the  best  tests  would  be 
imperfect.  If  the  APT  tests  correlating  highly  with  AFQT  were 
chosen,  for  instance,  then  the  ability  to  detect  incremental 
validity  of  APT  beyond  AFQT  would  be  jeopardized.  The  present 
method  of  selecting  tests  is  preferable,  since  it  would  be  better 
to  risk  committing  a  Type  I  error  than  to  risk  a  Type  II  error 
and  possibly  overlook  incremental  validity. 

Tests  with  Percent  Correct  scores  that  had  moderate  to  high 
positive  correlations  with  the  accuracy  criterion  (POSTACC)  and 
moderate  to  high  negative  correlations  with  the  time  criterion 
(T-TUT)  were  identified.  Therefore,  tests  were  retained  only  if 
high  PC  scores  corresponded  to  high  POSTACC  scores  and  fast 
learning  on  the  tutor. 

RT  measures  with  moderately  or  highly  negative  correlations 
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with  POSTACC  and  positive  correlations  with  T-TUT  were  also 
identified.  Therefore,  for  RT,  tests  were  retained  only  if  slow 
reaction  time  corresponded  to  poor  accuracy  and  slow  learning  on 
the  tutor.  Tables  7  to  9  present  the  results  of  regression 
analyses  with  POSTACC  as  the  dependent  variable. 


Table  7.  PC  and  RT  tests  correlating  most  highly  with  criteria. 


Criterion :  Accurac 


test  following  the  tu 


PC 

Multiple  R  .750 

R  Square  .562 

R  Square  Change 
F  Change 

Adjusted  R  Square _ .  536 


RT  PC  added  to  RT  added  to 


RT 

.497 

.752 

.752 

.247 

.566 

.566 

.320* 

.004 

12.244 

.445 

-231 

.530 

Table  8.  Validity  of  AFQT  and  incremental  validity  of  PC  and  RT. 


Criterion 

i;  Accuracy 

on  test  following  the 

AFQT 

PC  added 
AFOT 

to  RT  added  to 

AFOT 

Multiple 

R 

.725 

.775 

.735 

R  Square 

.525 

.600 

.540 

R  Square 

Change 

.075* 

.015 

F  Change 

3.178 

1.535 

Adjusted 

.52367 

.575 

_ .529 

Table  9.  Incremental  validity  for  AFQT,  PC  and  RT. 
Criterion ;  Accuracy  on  test  following  the  tutor  f POSTACC 


AFQT  added  to 
APT 


PC  added  to 
RT.AFQT _ 


RT  added  to 
PC^Aim 


Although  the  R-squares  are  of  greater  magnitude  for  the  selected 
PC  and  RT  scores,  most  increases  are  less  than  2%.  The  exception 
is  RT,  which  performs  much  better  using  the  selection  method  than 
using  composites  (.25  versus  .15).  In  addition,  although  the  R- 
squares  for  PC,  RT  and  AFQT  are  somewhat  larger  when  the  best 
tests  are  used,  the  trends  among  them  are  comparable  to  those 
reported  for  the  composites.  Thus,  the  two  methods  used  for 
summarizing  the  data  for  regression  analysis  may  reflect  a  range 
of  possible  R-square  values  which  depends  on  which  APT  tests  are 
used,  but  they  also  demonstrate  underlying  trends  among  PC,  RT 
and  AFQT. 

Regression  analyses  using  the  selected  PC  and  RT  scores  were 
also  carried  out  for  the  T-TUT  criterion.  Tables  10  to  12 
present  the  results  of  these  analyses. 


Table  10.  PC  and  RT  tests  correlating  most  highly  with  criteria. 


Criterion:  Total 

PC 

RT 

PC  added  to 

RT  added  to 

RT 

PC 

Multiple  R 

.730 

.570 

.767 

.767 

R  Square 

.533 

.325 

.589 

.589 

R  Square  Change 

.264* 

.056* 

F  Change 

18.292 

13.472 

9.915 

3.582 

Adiusted  R  Scmare 

!  .503 

.301 

.546 

.546 

Table  11.  Validity  of  AFQT  i 

and  incremental  validity  of  PC  and  RT. 

Criterion:  Total 

time  scent 

on  the 

AFQT  PC  added  to 

RT  added  to 

U  1 

.748  .722 

.559  .521 

.098*  .059* 


Multiple  R 
R  Square 
R  Square  Change 


.680 

.462 
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F  Change 

Adjusted  R  Square 


248.129 

.460 


3.543 

-»53Q 


3.432 

. Z..W. 


Table  12.  Incremental  validity  for  AFQT,  PC  and  RT. 


Criterion;  Total  time  spent  on  the  tutor  fT~TUT) 


AFQT 

added  to 
APT 

PC  added 
_..RT.  AFOT 

to  RT  added  to 

PC. AFQT 

Multiple  R 

.777 

.777 

.777 

R  Square 

.603 

.603 

.603 

R  Square  Change 

.015* 

.082* 

.044* 

F  Change  9 

.728 

3.20 

2.891 

Adjusted  R  Souare 

.561 

.561 

♦  561 _ 

The  improvements  are  similar  to  those  for  POSTACC.  The 
tests  showed  an  increase  in  R-square  for  T-TUT,  usually  of  4%  or 
less,  when  the  selection  method  was  used  instead  of  composites. 
Thus,  the  same  general  trends  were  observed  for  the  selection  and 
composite  methods  for  T-TUT.  PC  had  the  largest  R-square, 
followed  by  AFQT  and  RT.  Both  PC  and  RT  added  variance  over 
other  scores,  and  AFQT  added  a  small  amount  of  unique  variance 
over  PC  and  RT  combined  (1.5%,  p<.01). 


STUDY  2 

METHODOLOGY 

In  study  2,  402  subjects  completed  the  ASVAB,  a  battery  of 
CAM  tests,  and  a  computerized  tutor  designed  to  teach  flight 
engineering  knowledge  and  skills.  As  in  the  first  study, 
subjects  were  temporary  service  employees.  Approximately  25% 
were  women,  35%  were  White,  47%  were  Hispanic  and  14%  were  Black. 
Regression  analyses  were  conducted,  using  the  amount  of 
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information  learned  from  the  Flight  Engineering  tutor  as  the 
dependent  variable.  The  criterion#  POSTFAC,  was  a  measure  of  the 
percent  correct  on  a  test  given  following  the  tutor. 


RESULTS 

Tables  13  to  15  summarize  the  results  of  regression  analyses 
using  composites  of  PC  and  RT  scores.  The  same  tests  used  for 
the  composites  in  Study  1  were  used  for  these  analyses. 


Table  13.  Validity,  incremental  validity  of  PC  and  RT  composites. 
CRITERION:  POSTFAC  (Accuracy  on  test  of  flight  engineering^ _ 


PC 

RT 

PC  added  to 
RT 

RT  added  to 

PC 

Multiple 

R 

.870 

.561 

.880 

.880 

R  Square 

.757 

.315 

.774 

.774 

R  Square 

Change 

.459* 

.017* 

F  Change 

147.871 

21.813 

94.492 

3.521 

Adjusted  R  Scruare  .752 _ .300 _ .764 _ .764 


Table  14.  Validity  of  AFQT  and  incremental  validity  of  PC  and  RT. 
CRITERION:  POSTFAC  f Accuracy  on  test  of  flight  engineering) _ 

AFQT 

PC  added  to 

RT  added  to 

AFQT 

Multiple  R  .849 

.905 

.868 

R  Square  .720 

.818 

.753 

R  Square  Change 

.098* 

.033* 

F  Change  746.110 

25.563  6 

.284 

Adjusted  R  Souare  .719 

_ .814 _ 

.747 

Table  15.  Incremental  validity  for  AFQT, 

PC  and  RT. 

AFQT  added 

to  PC  added  to  RT  added  to 

PC.RT 

PC.AFOT 

Multiple  R  .913 

.913 

.913 

R  Square  .834 

.834 

.834 

R  Square  Change  .060* 

.081* 

.016* 

F  Change  100.613 

22.653 

4.408 

Adjusted  R  Scfuare  .826 

_ ^826 _ 

.826 _ 
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In  this  analysis,  the  AFQT  and  APT  tests  combined  account 
for  more  variance  than  they  did  in  Study  1  (83.4%  of  POSTFAC  in 
Study  2,  versus  58.7%  of  Accuracy  and  56.6%  of  Total  Time  in 
Study  1) .  This  may  be  due  to  the  very  high  Multiple  R  between 
AFQT  and  the  Study  1  criteria  (e.g.,  .73  between  AFQT  and 
POSTACC) .  On  the  other  hand,  the  unique  variance  accounted  for 
by  PC  and  RT  over  AFQT  is  nearly  twice  the  amount  found  in 
Study  1. 

Although  there  is  a  difference  in  the  magnitude  of  the  R- 
squares,  the  incremental  R-square  results  indicate  similar  trends 
to  those  observed  for  the  composites  in  Study  1.  In  both  cases, 
PC  has  the  highest  R-square,  followed  by  AFQT  and  RT.  Both  PC 
and  RT  show  incremental  validity  over  each  other  and  over  AFQT, 
as  in  the  analysis  with  the  total  time  criterion  in  the  first 
study.  The  variance  added  by  RT  over  PC  and  AFQT  combined  is 
small  but  significant  (1.6%,  p<.01).  PC  adds  unique  variance 
over  RT  and  AFQT  combined  (approximately  8%) ,  and  AFQT  adds 
unique  variance  over  PC  and  RT  combined  (6%) . 

As  in  the  first  study,  analyses  were  performed  using  a 
subset  of  the  tests.  Tables  16  to  18  present  the  results  of 
regression  analyses  using  only  the  tests  that  were  moderately  or 
highly  correlated  with  the  criterion,  POSTFAC. 
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Table  16.  PC  and  RT  tests  correlating  most  highly  with  criteria. 
CRITERION;  POSTFAC  (Accuracy  on  test  of  flight  engineering^ _ 


PC 

RT 

PC  added  to 
RT 

RT  added  to 

PC 

Multiple 

R 

.876 

.536 

.878 

.878 

R  Square 

.767 

.287 

.770 

.770 

R  Square 

Change 

.767 

.298 

.483 

.003 

F  Change 

49.846 

14.247 

30.941* 

.497 

Adjusted 

.751 

.748 

.748 

Table  17.  Validity 
CRITERION:  POSTFAC 

of  AFQT  and  incremental  validity  of  PC  and  RT. 
(Accuracy  on  test  of  fliaht  enaineerinal 

AFQT 

PC  added 

to 

RT  added  to 

AFOT 

Multiple  R 

.849 

.909 

.864 

R  Square 

.720 

.826 

.747 

R  Square  Change 

.720 

.106* 

.027* 

F  Change 

746.110 

9.196 

3.735 

Adjusted  R  Square  .719 _ .814 _ .739 


Table  18.  Incremental  validity 
CRITERION:  POSTFAC  (Accuracy  on 

for  AFQT,  PC  and 
test  of  flioht 

AFQT  added  to 

PC  added  to 

RT  added  to 

PC.RT 

Multiple  R  .913 

.913 

.913 

R  Square  .834 

.834 

.834 

R  Square  Change  .064* 

.087* 

.008 

F  Change  100.867 

7.653 

1.528 

Adjusted  R  Scjuare  .817 

.817 

.817 

Selecting  tests  for  the  regression  analysis  produces  the 
same  results  as  using  composites.  Like  the  analyses  in  Study  1, 
selecting  the  best  tests  did  not  substantially  raise  R-squares  in 
this  study.  In  fact,  the  one  no  ^ble  difference  is  that  RT  drops 
from  an  R-square  of  .315  when  composites  are  used,  to  ono  of  .287 
when  the  "best”  RT  scores  are  used,  and  the  unique  variance 
accounted  for  by  RT  over  PC  is  no  longer  significant.  Thus, 
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selecting  tests  correlated  highly  with  the  criterion  is  not 
always  effective  in  increasing  R-square.  In  general,  however,  it 
makes  little  difference  whether  composites  or  a  select  group  of 
tests  are  used  in  the  regression  equation. 

DISCUSSION  AND  CONCLUSION 

The  results  of  both  studies  indicate  that  scores  based  on 
percent  correct  and  reaction  time  are  useful  for  predicting 
learning  and  performance  on  the  complex  computerized  tutors. 
Furthermore,  these  variables  add  unique  variance  beyond  that 
contributed  by  a  standardized  measure  of  general  aptitude  (AFQT) . 
In  both  studies,  the  regression  analyses  indicate  that  Percent 
Correct  accounts  for  slightly  more  variance  in  criterion  scores 
than  that  accounted  for  by  AFQT.  The  analyses  also  show  that  the 
new  measures  have  incremental  validity  beyond  the  AFQT  in 
predicting  performance  on  the  tutor.  Response  latency  scores  add 
unique  variance,  but  scores  reflecting  error  rates  are  the  most 
useful  predictors,  whether  they  are  used  alone  or  in  addition  to 
other  variables. 

One  other  study  has  compared  the  validities  of  a  battery  of 
experimental  cognitive  tasks  and  the  ASVAB  tests  in  predicting 
performance  on  an  Air  Force  tutor.  Christal  (1991)  conducted  a 
study  using  military  recruits,  and  predicted  performance  on  a 
computerized  tutor  designed  to  teach  knowledge  and  skills  on 
logic  gates.  His  study  used  a  different  criterion  and  a 
different  set  of  computerized  tests.  He  found  that  the 
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computerized  measures  added  as  much  as  22%  unique  variance  over 
the  ASVAB  scores.  The  prerant  study  found  not  more  than  10% 
unique  variance  of  APT  over  AFQT,  but  there  was  less  variance 
remaining  after  AFQT  had  been  entered  into  the  regression  than 
there  was  in  Christal's  study  after  the  ASVAB  had  been  entered. 
The  correlations  between  the  standardized  aptitude  measures  and 
the  measures  from  the  computer  battery  were  much  lower  in 
Christal's  study  (e.g.,  .48,  vs.  .85  in  the  present  study;  See 
Table  8) .  Thus,  it  is  especially  noteworthy  that  incremental 
validities  were  obtained  for  the  percentage  correct  scores,  in 
spite  of  the  high  validities  for  the  AFQT  tests. 

In  sum,  two  essential  points  merit  repeating: 

1)  The  analyses  indicated  that  percentage  correct  scores  on 
the  cognitive  test  battery  had  higher  predictive  validities  than 
the  standardized  aptitude  measures  for  the  tutor  criteria. 

2)  Incremental  validities  were  obtained  for  the  computerized 
cognitive  information  processing  scores,  over  and  above  the 
validities  of  the  standardized  measures.  Note,  however,  that 
AFQT  scores  also  added  incremental  validity  over  the  computerized 
measures,  particularly  for  the  POSTFAC  criterion. 

The  results  of  this  research  are  encouraging,  but  a  few 
caveats  concerning  limitations  are  warranted.  One  limitation  is 
the  su.jject  population,  which  was  different  in  make-up  from  a 
typical  population  of  Air  Force  applicants.  The  subject 
population  contained  a  higher  proportion  of  minorities  than  would 
usually  be  found  in  an  Air  Force  sample.  In  addition,  the 
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question  of  motivation  arises,  as  it  would  in  other  studies  using 
non-applicant  samples.  Subjects  were  encouraged  to  perform  to 
the  best  of  their  ability,  but  the  degree  to  which  their  level  of 
motivation  would  have  differed  from  that  of  an  applicant  sample 
was  uncertain.  Further  empirical  research  investigating  the 
nature  and  measurement  of  subjects'  "motivational  states"  is 
necessary  (Arvey,  Strickland,  Drauden,  &  Martin,  1990) .  A  third 
issue  is  restriction  of  range.  One  might  argue  that  the  variance 
of  the  data  sample  should  have  been  corrected  to  that  of  the 
general  population.  However,  since  Air  Force  applicants  are  a 
select  group  with  above-average  ASVAB  scores,  and  the  sample  used 
in  the  study  represented  a  reasonable  match  to  the  population  for 
which  the  tests  were  constructed,  this  procedure  was  not  carried 
out.  As  stated  in  the  Principles  for  the  Validation  and  Use  of 
Personnel  Selection  Procedures  (1987),  validation  should  use  a 
sample  that  is  "reasonably  representative  of  the  populations  of 
people  and  jobs  to  which  the  results  are  to  be  generalized" 

(p.7)  . 

The  future  holds  exciting  prospects  for  the  development  of 
new  computerized  tests  and  their  use  for  the  understanding  of 
cognitive  information  processing  and  intelligence.  Further 
studies  will  no  doubt  bring  novel  methods  for  measuring  and 
understanding  human  cognition. 


6-19 


References 


Arvey,  R.D.,  Strickland,  w. ,  Drauden,  G, ,  &  Martin,  C.  (1990). 
Motivational  components  of  test-taking.  Personnel  Psychology. 
43,  695-716. 

Campbell,  J.P.  (1990).  An  overview  of  the  Army  selection  and 

classification  project  (Project  A) .  Personnel  Psychology.  43 . 
231-239. 

Christal,  R.E.  (1991) .  Comparative  validities  of  ASVAB  and  LAMP 
tests  for  logic  gates  learning  (AL-TP-1991-0031) .  Brooks  Air 
Force  Base,  TX:  Manpower  and  Personnel  Division,  Air  Force 
Human  Resources  Laboratory. 

Fleishman,  E.A.  (1988) .  Some  new  frontiers  in  personnel  selection 
research.  Personnel  Psychology.  41.  679-701. 

Hunt,  E.  (1982).  Toward  new  ways  of  assessing  intelligence. 
Intelligence .  6,  231-240. 

Kyllonen,  P.C.,  Christal,  R.E.  (1989).  In  R.F.  Dillon  &  J.W. 
Pellegrino  (Eds.),  Testing;  theoretical  and  applied 
perspectives,  (pp. 146-173) .  New  York:  Praeger. 

Kyllonen,  P.C.,  Woltz,  D.J.,  Christal,  R.E.,  Tirre,  W.C.,  Shute, 
V.J.,  &  Chaiken,  S.  (1990).  CAM-4;  Computerized  battery  of 
cognitive  ability  tests.  Unpublished  computer  program.  Brooks 
Air  Force  Base,  Texas. 

Society  for  Industrial  and  Organizational  Psychology  Inc.  (1987) 
Principles  for  the  validation  and  use  of  personnel  selection 
procedures;  Third  edition.  College  Park,  MD. 

Shute,  V.J.  (1992).  A  comparison  of  learning  environments;  All 
that  glitters...  (AL-TP-1991-0042) .  Brooks  Air  Force  Base,  TX: 
Manpower  and  Personnel  Division,  Air  Force  Human  Resources 
Laboratory . 

Sternberg,  R.J.  (1986) .  The  future  of  intelligence  testing. 
Educational  Measurement:  Issues  and  Practice.  5,  19-22. 


6-20 


Authors'  Note; 


The  Learning  Abilities  Measurement  Program  (LAMP)  research 
group  includes  Patrick  C.  Kyllonen,  Raymond  Christal  (from 
Universal  Energy  Systems,  Inc.)»  Scott  Chaiken,  Lisa  Gawlick, 
Carmen  Pena  (now  at  the  University  of  Iowa) ,  Valerie  Shute,  Bill 
Tirre  and  Dan  Woltz,  as  well  as  the  OAO  programming  staff,  Henry 
Clark,  Trace  Cribbs,  Rich  Walker,  Cindi  Garcia,  Jo  Ann  Hall, 
Janice  Hereford,  Terri  Purdue,  and  Karen  Raouf. 

The  authors  are  especially  grateful  to  Valerie  Shute  for 
conducting  a  larger  study  from  which  the  data  reported  here  were 
borrowed . 


6-21 


XHOROANZC  FIBER  AmLYSZS 
BY  SEM-BOZA 


Robert  F.  Diskin 
Department  of  Chemistry 
University  of  Scranton 
Scranton,  Pennsylvania  18520-4626 


Final  Report  for: 
Summer  Research  Progreun 
Armstrong  L^U^oratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Armstrong  Laboratories  OEHL 
Brooks  Air  Force  Base,  San  Antonio  Texas  78235 


September  1992 


7-1 


INORGANXC  FIBER  ANALYSIS 
BY  SEM-EDXA 


Robert  F.  Dlskin 
University  of  Scranton 
Scranton,  Pennsylvania  18520-4626 

Abstract 

The  air  asbestos  analysis  laboratory  at  Brooks  Air  Force  Base  often 
needs  to  identify  fibrous  material  that  may  or  may  not  be  asbestos.  In  order 
to  help  improve  the  laboratory's  eUDility  to  identify  such  fibers,  the  author 
prepared  and  analyzed  a  number  of  inorganic  fiber  standards  and  several  air 
filters  submitted  to  Brooks  with  an  Amray  1820  Scanning  Electron  Microscope 
and  Tracer  Northern  X-ray  Analyzer.  All  seven  types  of  asbestos  and  twenty 
non-asbestos  man-made  and  natural  fibers  were  analyzed  on  the  SEM.  The 
objective  of  the  project  was  to  supply  the  author's  professor  with  the 
necessary  data  to  design  a  system  for  fiber  identification  using  morphology 
and  the  elemental  analyses  obtained. 


7-2 


Introduction 


Asbestos  is  a  well-documented  health  hazard.  Exposure  to  airborne  asbestos 
fibers  can  lead  to  diseases  such  as  asbestosis,  lung  cancer,  and  mesothelioma 
(1).  It  is  important  therefore  that  scunples  suspected  to  contain  asbestos  be 
analyzed  for  these  fibers. 

The  air  asbestos  analysis  sect  .on  of  the  Analytical  Services  Division  of 
the  Occupational  and  Environmental  Health  Laboratory  analyzes  several  thousand 
filters  annually  for  their  fiber  content  according  to  NIOSH  method  7400.  This 
method  calls  for  using  phase  contrast  microscopy  to  count  all  fibers  with  a 
length  to  aspect  ratio  of  at  least  3:1. 

The  NIOSH  method  7400  receives  much  criticism  since  it  calls  for  counting 
fibers  on  air  filters  without  distinguishing  them  as  asbestos  or  non-asbestos 
fibers.  Furthermore,  the  NIOSH  method  is  unable  to  distinguish  organic  from 
inorganic  fibers.  Situations  arise  where  this  laboratory  needs  to  identify  the 
fibers  on  the  filters  as  asbestos  or  some  other  fibrous  material  as  a  service  to 
its  constituency.  Since  the  NIOSH  method  has  the  €UDility  to  exaggerate  the 
potential  health  hazard,  the  project  objective  was  to  assist  the  environmental 
officer  in  making  decisions  for  handling  high  fiber  counts.  One  potential  way  of 
providing  this  service  is  to  supplement  the  fiber  count  analysis  with  scanning 
electron  microscopy  using  an  SEM  which  is  equipped  with  an  x-ray  analyzer.  The 
combination  of  an  electron  micrograph  with  an  elemental  composition  as  determined 
by  energy  dispersive  x-ray  analysis  can  freqfuently  identify  fibers  as  asbestos 
or  non-asbestos,  and  often  yield  a  reasonable  answer  as  to  the  true  identity  of 
the  fiber.  Since  it  is  costly  to  analyze  every  fiber,  the  author's  professor 
designed  a  method  for  rapidly  analyzing  a  representative  fraction  of  the  fibers 
in  air  samples  to  help  evaluate  the  environment  where  the  scunpling  occurred  (5) . 
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The  project  outlined  in  this  paper  details  the  SEM-EDXA  technical  work  that  was 
required  for  the  professor's  goal. 

Tt^atTnientation 

The  scanning  electron  microscope  used  in  this  study  was  an  Amray  Model  1820 
ecfuipped  with  a  data  entry  keyboard  and  an  air-cooled  turbomolecular  vacuum  pump. 
It  was  an  automatic  imaging  SEM,  and  included  digital  image  storage  and 
processing.  It  was  capable  of  dot  mapping  of  elemental  locations.  Based  on 
previous  work,  all  fibers  were  processed  at  a  tilt  angle  of  38.5  degrees  in  order 
to  produce  the  most  intense  x-ray  pattern  sensed  by  the  x-ray  detector  ( 4 ) .  A 
detector  slide  position  of  45  nun  and  a  working  distance  of  12  mm  using  20  kev  at 
90  ma  proved  most  advantageous  in  obtaining  a  useful  EDXA  spectrum  in  less  than 
two  minutes  of  analysis  time  (2). 

The  x-ray  analysis  unit  was  a  Tracor  Northern  Series  II  X-ray  Analyzer  (TN- 
II)  equipped  with  a  standard  beryllium  window  detector.  The  TN-IZ  SQ  x-ray  data 
acquisition  and  analyzer  software  program  was  used  to  acquire  the  X-ray  spectrum. 
The  SQ  software  is  capable  of  performing  quantitative  analysis  on  X-ray  Spectra 
using  a  library  of  references  stored  on  disk.  The  program  uses  Multiple  Least 
Squares  Analysis  and  ZAF  or  Phi-Rho-Z  matrix  correction  procedures  to  calculate 
elemental  concentration  results.  ZAF  corrects  for  atomic  number  (Z),  absorption 
(A)  and  fluorescence  (F)  effects.  It  is  a  theoretical  correction  procedure  based 
on  excitation  conditions,  geometry  of  the  sample  and  intensity  ratios  constructed 
from  pure  element  samples.  The  package  also  makes  non-linear  background 
corrections  on  a  channel  by  channel  basis  and  is  dependent  upon  the  sample  to 
detector  geometry  ( 6 ) . 

An  Anatech  LTD  Hummer  VI  Sputtering  System  was  used  to  metal  coat  the 
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specimens  before  introduction  into  the  scanning  electron  microscope.  A  gold- 
palladium  alloy  was  used  as  the  coating  metal.  The  coating  was  about  8S%  gold  by 
weight . 

Methodology 

The  standard  inorganic  fiber  samples  were  cut  into  pieces  with  a  pair  of 
scissors.  The  pieces  ware  placed  on  a  carbon  painted  aluminum  stud.  The  studs 
were  painted  twice  and  before  the  second  coat  was  dry,  the  fibers  were  applied 
which  allowed  sufficient  adherence  of  the  fibers  to  the  studs.  The  studs  were 
placed  in  the  coating  chamber  of  the  Anatech  LTD  Hummer  VI  and  the  system  was 
evacuated  to  at  least  25  roillitorr.  Argon  was  introduced  into  the  system  to  a 
pressure  of  80  millitorr.  The  high  voltage  was  turned  on  such  that  a  10  milliamp 
current  flow  was  established.  A  discharge  was  created  which  sputtered  the  Au/Pd 
alloy  onto  the  specimens.  Coating  was  allowed  to  continue  until  a  layer  10 
nanometers  thick  was  deposited. 

The  samples  were  placed  in  the  vacuum  chamber  of  the  Amray  1820  scanning 
electron  microscope  and  searched  for  suitable  fibers.  The  selected  fibers  were 
photographed  at  a  magnification  which  illustrated  their  morphology.  After  the 
visual  inspection  ,  the  magnification  was  increased  until  the  SEM  partial  field 
(3cra  X  2cm  display  on  the  video  monitor  of  the  SEM)  fitted  entirely  within  the 
fiber  image  and  the  EDXA  was  obtained  for  the  fiber.  Under  these  experimental 
conditions  the  background  was  minimized  but  continuous  monitoring  and  adjustments 
were  usually  required  during  EDXA  data  acquisition  to  compensate  for  drift  of  the 
particle  field,  especially  at  high  magnification. 

A  portion  of  the  selected  fiber  was  analyzed  using  the  Tracor  Northern  X- 
ray  Analyzer.  At  least  six  energy  dispersive  x-ray  analyses  were  done  for  each 
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standard.  For  each  standard,  usually  three  fibers  received  one  analysis 
while  one  fiber  received  three  analysis  at  different  locations  along  the  fiber. 
Analysis  was  done  for  a  range  0~10  kev  for  90  seconds.  The  quantitative  analysis 
program  SQ  of  the  Tracor-Northern  system  was  activated  to  yield  a  quantitative 
analysis  of  those  elements  selected.  Elemental  selection  was  dependent  on  the 
known  composition  of  each  standard.  The  computer  gave  the  data  as  either  atomic 
percentage  or  weight  percentage.  All  analyses  printouts,  the  photographs  and  a 
printout  of  the  spectra  obtained  were  filed  in  a  binder  and  used  by  the  author's 
professor  for  his  research. 

For  the  air  samples,  a  4mm  triangle  was  cut  from  each  unknown  cellulose 
acetate  air  filter  submitted  to  the  Armstrong  Laboratory.  The  4  mm  sample  was 
mounted  on  an  aluminum  stud  covered  with  wet  carbon  paint.  When  dry,  the  sample 
was  coated  with  a  11  nm  Au/Pd  coating,  then  placed  in  the  SEH  vacv  hamber. 
Suitable  fibers,  those  with  correct  morphology  (  length  to  aspect  ratio  of  at 
least  3:1),  were  analyzed  for  elemental  composition.  Since  there  was  no 
preconceived  knowledge  of  the  fiber  composition,  eleven  elements  were  assayed 
(Si,  Mg,  Fe,  Ca,  Mn,  Na,  S,  P,  Cl,  K  and  Al).  A  minimum  of  five  fibers  were 
analyzed  for  each  air  filter  and  selection  of  fibers  was  as  random  as  possible 
meaning  no  fiber  analyzed  was  in  close  proximity  to  any  other  fiber  analyzed. 

Results  and  Discussion 

The  project  was  composed  of  two  phases;  the  collection  and  analysis  of 
known  inorganic  fiber  standards  followed  by  the  analysis  of  unknown  fibers  on 
several  air  samples  submitted  to  the  laboratory.  All  seven  types  of  asbestos  and 
twenty  non-asbestos  man-made  and  natural  fibers  were  analyzed  on  the  SEM  (see 
Table  1).  Because  this  laboratory  primarily  analyzes  samples  for  asbestos,  non- 
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asbestos  fibers  that  frequently  are  mistaken  for  asbestos  because  of  their 
morphology  were  selected. 

As  noted  earlier  all  printouts  and  photographs  were  assembled  into  a  small 
library  and  submitted  to  the  author's  professor.  Analysis  of  air  filters  are  best 
obtained  at  around  2000x.  At  this  magnification,  fibrous  talc  and  other 
crystalline  materials  which  commonly  yield  EDXA  data  comparable  to  that  obtained 
for  asbestos  could  be  distinguished  from  asbestos  and  asbestos-like  inorganic 
fibers  (2). 

Not  infrequently  some  peak  identifications  given  by  the  x-ray  analysis 
system  were  faulty.  The  most  common  misidentif ications  were  Nb  instead  of  Au,  T1 
instead  of  Cl,  Rb  and  Au  instead  of  Si,  and  Cl  instead  of  Pd  when  Cl  was  also 
present  (4) . 

A  scanning  electron  microscope  equipped  with  an  energy  dispersive  x-ray 
analyzer  presents  a  practical  method  of  identifying  inorganic  fibrous  material 
commonly  found  in  atmospheric  environments.  When  using  this  equipment,  a  number 
of  procedures  need  be  followed  in  order  to  assure  the  greatest  accuracy. 
Appropriate  magnification  of  a  fiber  is  that  which  reveals  adequate  morphology 
while  being  sufficient  to  completely  cover  the  partial  field  on  the  SEM  video 
monitor.  Several  EDXA  spectra  should  be  obtained  in  order  to  establish  the 
elemental  composition  of  the  standards  and  fiber  makeup  of  air  samples  with 
reasonable  accuracy. 
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TABI.E  1 


NON-ASBESTOS 

FIBERS 

ASBESTOS  FIBERS 

Alumina  Saffil 

Fiberglass  0-C 

Amosite 

Alumina 

Graphite  P-55 

Ant  hopho 1 1 it e 

Bulk  6000 

Graphite  T300 

Crocidolite 

Ca  Na  Phosphate 

Kaowool 

Chrysotile 

Calcium  Silicate 

Metallized  Plastic 

Tremolite 

Calcium  Sulfate 

Mineral  Wool  A 

Actinolite 

Cerawool 

Rockwool  A 

Ferroactinolite 

Cooperknit 

Sandtex 

Fiberfrax 

Talc  11 

Fiberfrax  7000 

Tritan  Kaowool 
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COMPONENTS  OF  SPATIAL  AWARENESS; 

EFFECTS  OF  AIR  FORCE  HGHTER  PILOT  TRAINING  AND  EXPERIENCE 


Mel  E.  Dror 

Department  of  Psychology 
Harvard  University 


Abstract 


Air  Force  pilots  with  varying  experience  and  training  participated  in 
two  experiments  which  assess  different  components  involved  in  spatial 
awareness.  One  experiment  examined  the  ability  to  track  the  motion  of 
multiple  moving  objects;  and  the  other  experiment  examined  the  ability  to 
extrapolate  motion.  We  found  that  motion  tracking  skills  were  affected  by  a 
"recency  of  flying"  affect  -pilots  who  recently  flew  were  more  accurate  than 
pilots  who  did  not.  No  such  affect  was  found  for  motion  extrapolating  skills. 
We  also  found  that  although  experienced  fighter  pilots  and  novice  non¬ 
fighter  pilots  had  comparable  performance  on  both  tasks  on  the  easiest 
conditions,  the  experience  fighter  pilots  were  better  at  both  tasks  on  the  most 
difficult  conditions.  However,  novice  pilots  were  better  at  the  intermediate 
levels. 
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COMPONENTS  OF  SPATIAL  AWARENESS: 

AFFECTS  OF  AIR  FORCE  FIGHTER  PILOT  TRAINING  AND  EXPERIENCE 

Itiel  E.  Dror 


Pilots  rely  on  spatial  awareness  for  the  successful  accomplishment  of 
missions  as  well  as  for  safety.  The  importance  of  spatial  awareness  is  further 
emphasized  in  the  tactical  aviation  environment  in  which  highly  agile 
fighter  aircraft  are  designed  to  operate  across  the  spectrum  of  flight  altitudes 
and  orientations.  Spatial  awareness  encompasses  being  aware  of  one's  own 
spatial  position  along  with  the  spatial  relations  to  various  elements  in  the 
surrounding  environment.  The  high  speed  and  maneuverability  of  fighter 
aircraft  results  in  constant  and  rapid  changes  in  the  spatial  positioning  of  the 
aircraft.  The  spatial  relations  to  stationary  ground  targets  as  well  as  to  other 
rapidly  moving  aircraft  constantly  changes.  Subsequently,  pilots  must  always 
assess  and  update  their  situation  awareness.  This  is  especially  demanding 
when  pilots  attend  to  cockpit  displays  and  are  thus  disconnected  from  direct 
visual  contact  to  their  surroundings.  Failure  to  maintain  spatial  awareness 
can  have  disastrous  consequences  as  evidenced  by  the  large  number  of 
accidents  which  have  been  attributed  to  loss  of  situation  awareness  and 
spatial  disorientation.  Furthermore,  maintaining  spatial  awareness  during  air 
combat  is  considered  to  be  an  essential  element  of  success. 

The  goal  of  this  study  is  to  understand  and  characterize  some  of  the 
subsystems  involved  in  spatial  awareness  and  the  factors  that  influence  them. 
The  study  also  examines  whether  spatial  awareness  abilities  are  affected  by 
Air  Force  fighter  traiiung  and  experience,  or  by  a  "recency  flying  affect"  ~ 
whether  or  not  the  pilots  have  recently  flown,  or  by  some  combination  of 
these  factors . 

Two  tasks  were  designed  to  tap  into  the  underpinnings  of  motion 
extrapolation  and  motion  tracking.  In  each  task  we  manipulated  the  number 
of  moving  objects  and  the  duration  of  tracking  and  extrapolation  processes.  If 
these  manipulations  effect  the  underling  processes  then  they  would  force 
more  processing  and  thus  effect  accuracy  rates  and  response  times.  The 
increased  response  time  and  error  rate  as  a  function  of  the  manipulations  can 
be  calculated  as  a  slope.  By  comparing  such  slopes  one  can  examine  the 
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specific  components  per  se-  independent  of  the  processes  involved  in 
encoding  the  stimulus  and  generating  the  response  itself  (which  are  reflected 
by  the  intercept  of  the  fxmction).  This  method  allowed  us  to  determine  the 
relative  contributions  of  specific  types  of  processing  (Sternberg,  1969). 

This  study  is  a  continuation  of  a  previous  study  where  pilots  were 
found  to  be  better  than  control  non-pilots  at  certain  components  of  visual 
cognition  but  not  at  others  (for  a  detailed  account  of  the  study  see  Dror,  1991; 
Dror,  1992;  and  Dror,  Kosslyn,  and  Waag,  1992). 

GENERAL  METHOD 

The  subjects  were  tested  individually  on  each  task  in  one  testing 
session,  which  lasted  on  the  average  forty-five  minutes.  Half  the  subjects 
were  first  tested  on  the  motion  tracking  task  and  then  on  the  motion 
extrapolation  task;  the  other  half  of  the  subjects  were  tested  in  the  reverse 
order.  The  subjects  were  given  verbal  instructions  and  then  were  asked  to 
paraphrase  them,  and  any  misconceptions  were  corrected.  The  subjects  began 
each  task  with  a  set  of  practice  trials.  During  the  practice  trials,  the  computer 
gave  feedback  by  beeping  when  the  subject  made  an  incorrect  response,  and 
the  subjects  were  encouraged  to  ask  questions.  During  the  test  trials,  no 
feedback  was  provided  and  no  talking  was  allowed.  The  tasks  required  the 
subjects  to  respond  by  pressing  keys  marked  "yes"  (the  "b"  key)  and  "no"  (the 
"n"  key)  on  the  computer's  keyboard.  The  subjects  used  two  fingers  of  their 
dominant  hand  to  press  the  keys.  The  tasks  were  administered  on  a 
Macintosh  II  ci  computer  with  a  high  resolution  video  display  card  (8.24 
video  card).  The  computer  was  connected  to  a  color  13  inch  multiscan 
trinitron  super  fine  pitch  Sony  monitor.  The  tasks  were  administered  by  a 
computer  program  that  used  the  Shell  and  Macglib  libraries  of  Micro  M.L.  Inc. 
All  subjects  sat  so  that  their  heads  were  approximately  50  centimeters  from 
the  computer  screen.  The  subjects  were  asked  to  respond  as  quickly  as  possible 
while  remaining  as  accurate  as  possible. 

Thirty-four  pilots  were  tested  on  both  experiments.  The  pilots  formed 
two  distinct  groups:  the  first  group  of  20  pilots  were  very  experienced  and 
highly  trained  fighter  pilots;  the  second  group  of  14  pilots  were  novice  pilots 
who  had  never  flown  fighter  aircraft.  The  mean  flying  hours  on  fighter 
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aircraft  (F-15  and  F-16)  of  the  first  group  was  1650  (range  1000-3100).  Their 
overall  mean  flying  hours  —including  non-fighter  aircraft—  was  2300  (range 
1400-3400).  Their  mean  age  was  34.8  (range  29-41).  The  second  group  of  pilots 
included  pilots  who  had  never  flown  a  fighter  aircraft,  but  were  assigned  to  be 
fighter  pilots.  They  all  had  recently  finished  the  undergraduate  training  of  the 
Air  Force  and  were  waiting  for  their  basic  fighter  aircraft  training  to  began. 

The  pilots  in  this  group  had  approximately  300  flying  hours  on  non-fighter 
aircraft.  Their  mean  age  was  26.1  (range  24-29).  All  the  pilots  had  completed  at 
least  college  education.  The  pilots  were  recruited  and  tested  at  Luke  Air  Force 
Base,  AZ. 

EXPERIMENT  1:  VISUAL  EXTRAPOLATION 

Visual  imagery  enables  us  to  manipulate  in  our  mind  the  visual 
stimuli  we  receive  from  the  outside  world.  We  can  manipulate  and 
transform  images  in  numerous  ways.  One  such  manipulation  encompasses 
shifting  and  changing  the  spatial  position  of  objects  within  a  complex  image 
(e.g.,  one  can  imagine  rearranging  items  on  the  desk  —shifting  the  phone  and 
the  computer  to  different  locations).  Indeed,  such  manipulations  are  often 
used  in  reasoning  (Hayes,  1981).  Motion  can  be  accomplished  in  mental 
imagery  by  constantly  shifting  the  spatial  positions  of  items  within  the  image. 
Motion  extrapolation  requires  such  changes,  whereby  the  imagined  motion  is 
based  on  previously  seen  motion.  Correct  timing  and  spatial  perception  are 
also  key  factors  that  enable  the  imagined  motion  to  be  in  the  same  speed  and 
trajectory  as  the  previously  seen  motion.  Thus,  motion  extrapolation  requires 
the  use  of  imagery  as  well  as  perception  of  time  and  space.  Pilots  ability  to 
extrapolate  motion  is  particularly  interesting  as  previous  studies  show  that 
pilots  have  unique  abilities  in  some  processes  of  visual  imagery  -visual 
mental  rotation  (Dror,  1992)—  but  not  in  others  —scanning  visual  images 
(Dror,  Kosslyn,  &  Waag,  1992). 

We  used  a  visual  extrapolation  task  that  was  a  variant  of  one  devised  by 
Dror  (1991;  see  also  Dror,  Kosslyn,  and  Waag,  1992).  In  the  previous  study, 
subjects  were  required  to  extrapolate  the  motion  of  one  object  that  was 
moving  in  a  circular  trajectory.  The  study  revealed  that  the  components 
involved  in  motion  extrapolation  work  harder  when  the  trajectory  had  to  be 
projected  a  greater  distance.  In  the  present  study  subjects  were  required  to 
extrapolate  the  motion  of  multiple  objects  moving  in  various  straight 


trajectories.  It  was  assumed  that  it  is  also  more  demanding  when  the  motion 
of  more  objects  had  to  be  extrapolated.  A  group  of  moving  balls  were 
presented,  and  the  subjects  attended  to  a  subgroup  of  them  that  were  flashing. 
All  balls  were  then  removed  from  the  display,  and  the  screen  remained 
blank.  After  a  delay  the  balls  reappeared,  as  if  they  have  been  in  constant 
motion  while  the  screen  was  blank.  One  ball  was  presented  as  a  probe  ball  and 
the  subjects  were  required  to  judge  whether  or  not  this  ball  was  one  of  the 
balls  that  was  flashing  earlier. 

Method 

Materials.  The  stimuli  were  round  disks  (the  "balls")  18  pixels  in 
diameter  --  corresponding  to  0.668  cm  and  0.765  degrees  of  visual  angle— 
which  moved  in  straight  trajectories  on  the  screen.  The  balls'  initial 
trajectories  were  random.  The  balls  bounced  off  the  walls  of  the  screen  but 
moved  through  each  other.  Twelve  black  bails  were  presented  in  motion  on 
the  screen  while  a  subset  of  them  (1,  2,  or  3)  flashed  -changed  colors  from 
black  to  red  and  back  every  60  ms.  The  speed  of  the  motion  was  4.33  cm  per 
second  -corresponding  to  4.96  degrees  of  visual  angle.  The  motion  was 
created  by  displaying  a  new  screen  every  60  ms  with  each  ball  advancing  7 
pixels  in  its  trajectory  -corresponding  to  0.260  cm  and  0.298  degrees  of  visual 
angle.  The  computer  generated  the  new  position  for  each  of  the  bails 
throughout  the  trial.  While  the  screen  was  blank  the  computer  continued  to 
generate  ball  motion,  however  the  bails  were  not  displayed  on  the  computer 
screen 

A  total  of  96  trials  were  prepared.  The  trials  were  constructed  in  8  blocks 
of  12  trials  each.  Half  the  trials  in  each  block  should  have  been  evaluated  as 
"yes"  trials  -presenting  a  ball  probe  that  had  flashed  earlier—  and  the  other 
half  were  "no"  trials  —presenting  a  bail  that  had  not  flashed  earlier.  For  each 
of  the  6  "yes"  and  "no"  trials  in  every  block,  half  had  a  short  time  delay  and 
half  a  long  time  delay.  We  increased  the  short  and  long  time  delays  used  in 
our  previous  study  (Dror,  1991;  see  also  Dror  et  al.,  1992)  as  well  as  increasing 
the  time  difference  between  them.  The  short  time  delay  was  now  2  seconds 
and  the  long  one  3.5  seconds.  For  each  3  trials  in  every  block  that  had  the 
same  time  delay  and  the  same  correct  response,  one  required  to  extrapolate 
the  motion  of  1  ball,  one  of  2  balls,  and  one  of  3  balls.  Thus,  each  block  had  12 
trials  that  included  all  possible  combinations  of  variables.  The  trials  within 
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each  block  were  randomly  ordered  with  the  constraint  that  the  same  number 
of  balls,  time  delay,  or  response  could  not  appear  more  than  three  times  in 
succession.  An  additional  12  trials  were  prepared  as  practice  trials.  The  block 
of  practice  trials  had  the  same  structure  as  the  testing  blocks  and  included  all 
variable  combinations.  The  performance  of  the  subjects  was  examined  based 
on  the  manipulation  of  the  time  delay  and  the  number  of  balls  that  were 
extrapolated. 

Procedure.  A  trial  began  with  an  exclamation  mark.  When  the  subject 
was  ready,  he  pressed  the  space  bar,  and  then  12  moving  balls  appeared  on  the 
computer  screen.  The  subject  was  told  to  track  the  motion  of  the  flashing 
balls,  and  to  keep  tracking  them  after  the  balls  disappeared  as  if  they  were  still 
moving  on  the  computer  screen.  When  the  balls  re-appeared,  one  of  the  12 
original  balls  was  flashing.  The  subject  was  to  respond  "yes"  if  the  flashing 
ball  was  one  of  the  balls  that  was  flashing  earlier,  and  "no"  if  it  was  not. 
Immediately  after  the  response,  another  exclamation  mark  appeared  and  a 
new  trial  began. 

Results 

The  data  were  analyzed  using  an  analyses  of  variance.  Separate  analyses 
were  performed  for  response  times  and  error  rates.  The  data  included  0.7% 
responses  that  were  outliers  —greater  than  2.5  times  the  mean  of  the 
remaining  scores  in  that  cell—  and  were  replaced  by  the  mean  of  the  cell. 
Incorrect  responses  were  excluded  from  mean  response  times  computations. 
Each  analysis  included  two  between  subject  variables:  fighter/non-fighter 
pilots  and  flow  recently/did  not  fly  recently  (pilots  who  had  more  than  10 
flying  hours  in  the  past  2  weeks  and  over  15  in  the  past  4  weeks  were 
considered  as  pilots  who  flow  recently,  and  pilots  who  did  not  fly  at  all  in  the 
past  2  weeks  and  less  than  4  hours  in  the  past  4  weeks  were  considered  as 
pilots  who  had  not  flown  recently);  and  two  within  subject  variables:  the 
number  of  trajectory  that  were  extrapolated  (the  number  of  balls)  and  the 
duration  of  extrapolation. 

The  pilots  were  able  to  extrapolate  motion  with  approximately  75% 
accuracy.  They  were  even  able  to  extrapolate  the  motion  of  3  balls  at  the  same 
time  with  an  accuracy  level  of  approximately  64%.  We  were  interested  to  see 
which  factors  effected  their  performance,  and  whether  or  not  these  factors 
interacted.  We  were  especially  interested  to  see  if  any  of  the  within  variables 
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(that  tap  into  the  extrapolation  process  per  se)  interacted  with  any  of  the 
between  variable  (that  represent  different  training  and  experience  of  pilots). 
Such  an  interaction  would  show  whether  training  and  experience  affects 
components  involved  in  motion  extrapolation. 

We  found  that  the  number  of  moving  balls  that  were  extrapolated 
effected  both  response  time  and  error  rate,  F(2,60)=  33.19,  p<  .01  for  response 
time  (with  means  of  1223, 1450,  and  1583  ms  for  1, 2,  and  3  balls,  respectively), 
and  F(2,60)=  53.00,  p<  .01,  for  error  rates  (with  means  of  12.22,  27.38,  and  36.48 
percent  error  for  1,  2,  and  3  balls,  respectively).  Indeed,  linear  contrasts 
revealed  that  response  times  increased  linearly  with  the  number  of 
trajectories  that  were  extrapolated,  F(l,60)=  47.67,  p<  .01,  as  did  error  rates, 
F(l,60)=  73.87,  p<  .01.  The  duration  of  the  extrapolation  did  not  effect 
response  time,  F(l,30)=  1.01,  p>  2.5,  but  did  effect  error  rate,  F(l,30)=  32.48,  p< 
.01  (with  means  of  21.26  and  29.47  percent  error  for  short  and  long  time 
delays,  respectively).  It  is  important  to  realize  that  the  two  dependent 
measures,  response  time  and  error  rate,  are  inter-related  (see  Luce,  1986). 
Thus,  a  manipulation  can  affect  performance  in  either  response  time,  error 
rate,  or  both. 

After  establishing  that  the  manipulation  of  the  number  of  trajectories 
that  were  extrapolated  and  the  duration  of  the  extrapolation  did  effect 
performance,  we  proceeded  to  see  if  either  of  the  between  variables  -being  a 
fighter  pilot  and  recency  of  flying—  made  a  difference  in  how  the 
manipulations  affected  performance.  As  illustrated  in  Figure  1,  we  found  an 
interaction  in  error  rate  between  fighter  pilot  grouping  (fighter  pilots  vs,  non¬ 
fighter  pilots)  and  the  number  of  trajectories  that  were  extrapolated,  F(2,60)= 
3.66,  p=  .03.  This  interaction  reflected  that  even  though  both  groups  of  pilots 
had  similar  error  rates  at  the  easiest  condition  when  only  one  trajectory  was 
extrapolated,  the  fighter  pilots  were  more  accurate  at  the  hardest  condition 
when  three  trajectories  had  to  be  extrapolated  at  once.  However,  the  non¬ 
fighter  pilots  were  more  accurate  in  the  intermediate  condition  when  only 
two  trajectories  were  extrapolated.  No  other  interactions  were  observed,  all 

p>  .2. 

Fighter  pilots  and  non-fighter  pilots  had  comparable  overall 
performance,  F  <1  for  both  response  time  and  error  rate.  Finally,  we  found 
that  pilots  who  had  flown  recently  and  pilots  who  had  not  flown  recently  had 
comparable  performance  as  well,  F<  1  for  both  response  times  and 
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comparable  error  rates. 


Insert  Figure  1  About  Here 


Discussion 

Replicating  the  finding  of  Dror,  Kosslyn,  and  Waag  (1991)  we  found  that 
the  longer  time  delays  effected  performance.  We  also  found  that  the 
increasing  number  of  trajectories  that  were  extrapolated  caused  linear 
increases  in  response  time  and  error  rate.  Air  Force  fighter  pilot  training  and 
experience  effected  the  pilots  ability  to  extrapolate  motion,  however  this  effect 
was  limited  to  the  number  of  trajectories  that  were  extrapolated  and  did  not 
effect  the  ability  to  extrapolate  over  different  time  durations. 

Although  fighter  pilots  had  superior  performance  in  the  most  difficult 
conditions,  their  performance  was  not  better  across  all  conditions. 

EXPERIMENT  2:  MOTION  TRACKING 

Tracking  a  single  ball  that  is  moving  at  high  speed  is  not  an  easy  task. 
Professional  baseball  batters  need  to  track  a  ball  moving  at  speeds  of  up  to  100 
mph  —producing  angular  velocities  greater  than  500  degrees  per  second. 
Indeed,  professional  baseball  batters  have  superior  skills  at  tracking  such  a 
high  speed  moving  ball  (Bahill  and  LaRitz,  1984).  Tracking  multiple  objects  is 
even  more  demanding  as  the  attention  system  needs  to  cope  with  numerous 
objects  moving  in  different  direction  at  the  same  time.  Several  theories  have 
been  suggested  on  how  the  attention  system  accomplishes  such  a  task  (for  a 
good  review  see  Yantis,  1992).  We  set  out  to  test  pilots  on  a  demanding  task 
which  required  them  to  track  up  to  6  moving  balls  at  the  same  time. 

The  motion  tracking  task  was  a  variant  of  one  used  by  Intriligator, 
Nakayama,  &  Cavanagh  (1991),  which  was  based  on  a  task  devised  by 
Pylyshyn  &  Strom  (1988).  A  group  of  moving  balls  were  presented  on  the 
computer  screen.  The  subject  was  asked  to  track  a  flashing  subset  of  the  balls, 
and  to  continue  to  do  so  after  they  stopped  flashing  (and  thus  became 
indistinguishable  from  the  other  balls  on  the  screen).  The  balls  continued  to 
move  on  the  computer  screen  after  the  flashing  was  stopped;  after  a  time 
delay  one  of  the  balls  was  designated  as  a  probe  ball,  and  the  subject  had  to 
judge  whether  or  not  it  was  one  of  the  balls  that  had  flashed  earlier. 


Method 

Materials.  The  same  materials  used  in  the  extrapolation  task  were  used 
here,  except  that  4, 5,  or  6  bails  were  flashed  and  the  balls  did  not  disappear 
after  the  flashing  was  concluded.  The  balls  remained  on  the  screen  and  the 
pilots  had  to  track  the  balls  that  flash,  d  earlier.  To  force  actual  tracking  and 
not  enable  the  pilots  to  rely  on  extrapolation,  the  trajectory  of  each  ball  was 
changed  so  the  motion  was  random  -every  60  ms  the  balls  changed  trajectory 
by  a  random  degree  shift  of  up  to  30  degrees.  To  avoid  confusing  between 
colliding  baUs,  the  balls  now  bounced  off  each  other  when  they  collided. 

Procedure.  The  same  procedure  used  in  the  extrapolation  task  was  used 

here. 

Results 

The  data  was  analyzed  as  in  experiment  1,  0.6%  of  the  data  were 
considered  outliers.  The  pilots  were  able  to  track  motion  of  multiple  objects 
with  approximately  78%  accuracy.  Again,  we  were  interested  in  examining 
which  factors  affected  their  performance.  The  number  of  balls  affected  both 
response  time,  F(2,60)=  5.40,  p<  .01  (with  mean  response  time  of  1164, 1213, 
and  1220  ms  for  4,  5,  and  6  balls,  respectively),  and  error  rates,  F(2,60)=  10.06, 
p<  .01  (with  mean  error  rates  of  18.67, 20.00,  and  27.94  %  for  4, 5,  and  6  balls, 
respectively).  Indeed,  linear  contrasts  showed  that  response  time  and  error 
rate  increased  linearly  when  the  motion  of  more  balls  had  to  be  tracked, 
F(l,60)=  4.97,  p=  ,03  for  response  time,  and  F(l,60)=  13.79,  p<  .01  for  error  rates. 
The  duration  of  the  tracking  did  not  affect  response  time,  F(l,30)=  1.03,  p>  .2, 
but  did  effect  accuracy,  F(l,30)=  8.29,  p<  .01  (with  mean  error  rate  of  21.26  and 
26.47  for  short  and  long  tracking  durations). 

Most  interestingly,  we  found  that  recency  in  flying  had  an  overall  effect 
on  accuracy,  F(l,30)=  3.94,  p=  .05  (with  mean  error  rate  of  26.97  and  22.57  %  for 
pilots  who  had  not  flown  recently  and  pilots  who  had,  respectively).  No  such 
differences  were  found  in  response  time,  F<  1.  No  overall  differences  were 
found  between  fighter  and  non-fighter  pilots,  F<  1  for  both  response  time  anu 
error  rate. 

We  observed  an  interaction  in  error  rate  between  the  number  of  balls 
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tracked  and  the  pilot  grouping  (fighter  vs.  non-fighter  pilots),  F(2,60)=  4.01,  p= 
.02.  As  illustrated  in  Figure  2,  this  interaction  reflected  that  even  though  both 
pilots  groups  had  similar  error  rates  at  the  easiest  condition  when  only  4  balls 
were  tracked,  the  fighter  pilots  were  more  accurate  at  the  most  difficult 
condition  when  6  balls  were  tracked  at  once.  However,  the  non-fighter  pilots 
were  more  accurate  in  the  intermediate  condition  when  five  balls  were 
tracked. 


Insert  Figure  2  About  Here 


We  also  foimd  a  trend  for  a  four-way  interaction  in  error  rates;  F(2,60)= 
2.90,  p=  .06,  for  the  interaction  between  the  number  of  balls  tracked,  the 
duration  of  the  tracking,  the  pilot  grouping  (fighter  vs.  non-fighter  pilots), 
and  recency  of  flying.  As  reflected  by  linear-by-linear  contrasts,  and 
illustrated  in  Figure  3  (top  left),  looking  only  at  the  pilots  who  had  flown 
recently,  the  fighter  pilots  were  much  better  at  the  most  difficult  condition 
relative  to  the  intermediate  condition,  F(l,60)=  7.95,  p<  .01;  this  is  consistent 
with  the  interaction  we  found  between  pilot  grouping  and  number  of  balls. 
However,  we  found  that  looking  only  at  the  pilots  who  had  not  flown 
recently,  as  illustrated  in  Figure  3  (top  right),  the  error  rate  was  especially  high 
for  fighter  pilots  at  the  intermediate  condition  and  for  non-fighter  pilots  at 
the  easiest  condition.  Further  evidence  for  this  trend  was  revealed  by  a  linear- 
by-linear  contrast,  and  is  illustrated  in  Figure  3  (bottom  left),  showing  that  for 
the  easiest  condition  the  non-fighter  pilots  who  did  not  fly  recently  were 
much  more  effected  by  the  duration  of  the  tracking  process  than  those  who 
had  flown  recently,  F(l,60)=  5.47,  p=  .02 ;  in  contrast  no  such  affect  was 
observed  in  the  fighter  pilots  group,  F(l,60)=  1.66,  p>  .2.  However,  in  the 
intermediate  condition,  illustrated  in  Figure  3  (bottom  center),  the  fighter 
pilots  showed  such  an  effect;  fighter  pilots  who  had  not  flown  recently  were 
much  more  effected  by  the  duration  of  the  tracking  process  than  the  ones  who 
had  flown,  F(l,60)=  13.89,  p<  .01  for  the  linear-by-linear  contrasts.  No  such 
effect  was  observed  for  the  non-fighter  pilots,  F<  1,  for  the  corresponding 
linear-by-linear  contrast.  For  the  most  difficult  condition,  as  illustrated  in 
Figure  3  (bottom  right)  no  such  differences  were  observed,  F<  1,  for  both  cases. 
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Insert  Figure  3  About  Here 


The  most  interesting  results  pertains  to  the  "recency  flying  effect" 
observed  in  the  motion  tracking  task;  this  is  especially  intriguing,  given  that 
we  did  no^  find  such  an  effect  in  the  motion  extrapolation  task.  Otherwise, 
the  general  pattern  of  results  was  very  similar;  we  found  an  interaction  in 
error  rate  between  the  number  of  balls  and  the  pilot  grouping  (fighter  vs. 
non-fighter).  In  both  experiments  we  found  that  both  groups  had  comparable 
error  rate  at  the  easiest  condition,  but  the  fighter  pilots  were  better  at  the  most 
difficulty  condition.  However,  the  non-fighter  pilots  were  better  at  the 
intermediate  condition. 

A  hint  for  a  possible  explanation  to  this  observation  might  come  from 
the  trend  for  the  four-way  interaction  that  was  found  in  this  task.  The  pilots 
that  had  not  flown  recently  —fighter  and  non-fighter—  showed  declining 
ability  in  tracking  multi  objects.  The  enhanced  ability  to  track  objects  by  pilots 
probably  does  not  decrease  all  at  once.  Flying  probably  primes  some  of  the 
underlying  subsystems,  and  as  time  passes  some  of  the  processing  abilities 
decrease.  Thus,  it  is  possible  that  different  subsystems  decline  in  different 
ways  and  at  different  rates;  indeed,  pilots  who  did  not  fly  recently  showed  an 
atypical  pattern  of  errors,  as  illustrated  in  Figure  3  (top  right).  Such  an 
explanation  accounts  for  the  interaction  data  presented  in  Figure  2,  as  well  as 
Figure  1  —because  it  shows  that  the  pilots  who  had  not  flow  recently 
accoimted  for  much  of  the  error  pattern  (high  error  in  the  intermediate 
condition  by  fighter  pilots,  and  high  error  in  the  easy  condition  by  non-fighter 
pilots).  However,  as  reflected  in  Figure  3  (top  right),  this  does  not  completely 
account  for  the  data.  There  might  have  also  been  an  affect  of  age;  the  fighter 
pilots  were  older  than  the  non-fighter  pilots  (8.7  years  older  on  the  average). 
Even  though  this  is  not  a  large  difference,  the  fighter  pilots  mean  age  was  in 
the  30s  and  the  non-fighter  pilots  was  in  the  20s.  Visual  perceptual  skills  are 
at  their  pick  in  the  20s  and  decline  in  the  30s.  Thus,  it  is  possible  that  the 
performance  of  the  fighter  pilots  was  over-clouded  in  the  intermediate 
condition  by  decreased  perceptual  abilities.  In  the  most  difficult  condition  — 
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when  6  balls  were  used-  their  superior  tracking  ability  outweighed  their 
decreased  perceptual  skills  and  they  performed  better.  Still  another  possible 
explanation  for  this  pattern  of  results  is  that  the  groups  used  different 
strategies  to  perform  the  tasks.  If  both  groups  of  pilots  used  the  same  strategies 
under  the  different  conditions,  then  either  both  groups  would  perform  the 
same  or  one  group  would  perform  better.  Our  results  show  a  disassociation  of 
performance  across  the  different  conditions.  Thus,  if  different  pilots  use 
different  strategies  to  track  objects,  then  such  results  are  possible.  Our  present 
research  does  not  allow  us  to  determine  what  caused  this  pattern  of  result 
Interestingly,  this  pattern  occurred  in  both  tasks. 

GENERAL  DISCUSSION 

We  set  out  to  investigate  some  system  components  involved  in  spatial 
awareness.  We  found  that  Airforce  trairung  and  experience  effects  both 
motion  tracking  and  motion  extrapolation  abilities.  Fighter  pilots  did  perform 
better  in  both  tasks  in  the  most  difficult  conditions.  However,  this  superior 
performance  was  not  observed  across  all  conditions.  Although  both  tasks 
produced  the  same  pattern  of  results,  the  ability  to  track  motion  was 
enhanced  by  a  "recency  flying  affect",  yet  the  motion  extrapolation  was  not. 

Our  research  explores  the  abilities  to  track  and  extrapolate  motion, 
which  seem  to  be  important  skills  involved  in  spatial  awareness.  These  skills 
are  especially  critical  in  the  domain  of  piloting  in  which  motion  is  a  key 
factor.  Spatial  awareness  is  a  complex  ability  and  probably  involves  many 
additional  components.  One  such  component  concerns  the  transformation  of 
the  viewer  centered  retinotopic  visual  input  into  a  spatiaotopic  object 
centered  representation  (for  more  detail  see  Kosslyn  et  al,  1990;  Kosslyn  & 
Koenig,  1992).  Another  component  that  is  probably  involved  in  spatial  the 
awareness  of  pilots  is  the  ability  to  integrate  different  sources  of  knowledge. 
This  includes  cross  modality  inputs  —pilots  receive  auditory  information 
regarding  spatial  positioning  of  elements  in  their  furrounding  via 
communication  as  well  as  directly  from  their  environment.  Pilots  also  need 
to  integrate  the  spatial  information  they  obtain  via  the  aircraft  instruments  as 
well  as  maps  and  aerial  photographs. 

In  order  to  correctly  measure  spatial  awareness  ability,  one  needs  to  first 
understand  which  component  subsystems  are  involved  in  this  process,  and 
how  they  interact.  Then,  after  having  such  a  model,  one  needs  to  explore 
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which  factors  affect  each  component  and  use  these  factors  as  a  tool  for 
quantifying  the  efficiency  of  the  component  subsystem.  In  this  study  we  take  a 
first  step  in  this  direction  —we  explore  two  such  subsystems  and  speculate  on 
two  additional  subsystems  that  may  be  involved  in  spatial  awareness. 

FUTURE.  ANALYSIS 

Additional  analysis  of  the  data  from  this  study  may  provide  more 
insight  about  motion  tracking  and  extrapolation,  and  how  Air  Force  training 
and  experience  might  affect  these  abilities.  Four  additional  analysis  are 
recommended: 

1.  To  further  examine  the  claim  that  the  two  tasks  do  indeed  tap  onto 
different  components  we  need  to  performed  a  correlation  analysis  between 
the  two  tasks.  It  would  further  establish  that  motion  tracking  and  motion 
extrapolation  are  indeed  distinct  processes. 

2.  In  the  present  initial  report  the  mean  response  time  and  error  rate  for  each 
subject  was  calculated  for  each  cell  of  trails  that  had  the  same  conditions 
(number  of  balls  and  duration).  We  can  substitute  mean  scores  of  cells  by 
slopes  of  increase  time  and  error  as  a  function  of  difficulty.  The  response  of 
the  subjects  in  each  trial  can  be  evaluated  by  considering  how  probable  it  was 
to  make  a  mistake  -and  thus  how  difficult  it  was  to  give  the  correct  response. 
The  closer  the  probe  ball  was  to  other  balls  the  more  difficult  the  trial  was. 
Specifically,  trials  would  be  more  difficult  for  the  "off’  trials  when  there  was  c 
small  disparity  between  the  location  of  the  probe  and  an  actual  position  of  a 
ball  compared  to  those  that  had  a  great  disparity;  and  similarly  for  the  "on" 
trials,  it  would  be  difficult  if  a  distractor  ball  -one  that  did  not  flash  earlier— 
was  near  the  probe  ball,  than  when  it  was  far  away  from  it.  Thus,  instead  of 
using  mean  response  time  and  error  rate  per  cell,  we  can  calculate  a 
regression  slope  that  correlates  the  errors  and  response  times  to  the 
"likelihood  of  making  a  mistake"  . 

3.  Additional  analysis  can  further  explore  the  abilities  of  the  fighter  pilots 
themselves.  The  group  of  20  fighter  pilots  who  were  tested  on  the  tasks  were 
composed  of  14  pilots  who  flew  F-16  and  F-15E  and  were  thus  primarily 
trained  on  air-to-ground  missions,  and  6  pilots  who  flew  F-15  and  were  thus 
primarily  trained  on  air-to-air  combat.  It  would  be  interesting  to  explore  if 
these  two  types  of  pilots  differed  in  skills  of  motion  tracking  or  extrapolation. 

4.  Test  non-pilots  control  and  compare  their  performance  to  the  pilots  (to  the 
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overall  performance  of  all  the  pilots  and  to  the  specific  subgroups  of  pilots). 
This  data  can  be  compared  to  the  data  obtained  in  the  previous  study  where 
pilots  and  non-pilots  were  tested  on  a  variety  of  visual-spatial  tasks  (Dror, 
Kosslyn,  &  Waag^  1992). 
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CAEnONS 


Figure  1:  The  results  from  the  motion  extrapolation  task. 

Figure  2:  The  interaction  between  pilot  grouping  (fighter  vs.  non-fighter)  and 
the  number  of  balls  in  the  motion  tracking  task. 

Figure  3:  The  four-way  interaction  in  the  motion  tracking  task;  top 

panels:  error  rate  as  a  function  of  experience,  number  of  balls,  and  time 
delay  (left  panel-  pilots  who  flew  recently,  right  panel-  pilots  who  did 
not  fly  recently);  Bottom  panels:  error  rate  as  a  function  of  experience, 
recency  of  flying,  and  time  delay  (left  panel-  easy  condition;  center 
panel-  intermediate  condition;  right  panel-difficulty  condition). 
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Abstract 

A  preliminary  investigation  was  conducted  to  observe  the  effects  of  a  dynamic 
model  during  acquisition  of  a  console  operations  task.  The  console  operations  task 
studied  was  a  simulation  of  a  procedure  involving  the  operation  of  a  remote  crane  control 
arm.  One  group  of  subjects  had  access  to  a  dynamic  graphical  model  during  practice  of 
the  procedure.  The  dynamic  model  appeared  as  a  meaningful  display  that  responded  to 
console  operations.  While  one  group  of  subjects  received  the  dynamic  model  during 
acquisition,  ail  subjects  were  tested  on  their  knowledge  of  the  procedure  without  access 
to  the  model.  Results  indicated  that  during  acquisition,  time-on-task  and  errors  were 
roughly  equivalent  for  the  model  and  no-model  groups.  During  testing,  however,  the 
model  group  was  able  to  perform  the  procedure  faster  and  more  accurate.  Future  studies 
on  the  effects  of  a  dynamic  graphical  mode!  during  procedural  acquisition  are  being 
planned. 
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A  PRELIMINARY  INVESTIGATION  OF  THE  EFFECTS  OF  A  DYNAMIC  GRAPHICAL 
MODEL  DURING  PRACTICE  OF  A  CONSOLE  OPERATION  SKILL 


John  D.  Farquhar 


INTRODUCTION 

Notable  theories  of  cognition  and  learning  have  asserted  a  fundamental  distinction 
between  declarative  and  procedural  knowledge  types  (Anderson,  1 983;  Kyllonen  &  Shute, 
1989).  Using  device  operation  as  an  example,  declarative  knowledge  is  the  fact-like 
(what-it-is,  what-it-does,  and  how-it-works)  knowledge  of  a  device.  Procedural  knowledge 
is  the  rule-like  (how-lts-done)  of  device  operation.  Other  forms  of  knowledge,  such  as 
qualitative  process  or  mental  models  describe  our  understanding  of  the  dynamic  causal 
nature  of  various  devices  (Anderson,  1988).  Knowledge  of  a  skill,  such  as  device 
operation,  is  arguably  constructed  of  various  knowledge  types  or  forms  of  knowledge. 

Procedural  knowledge  is  clearly  essential  for  device  operation,  but  the  other  kinds 
of  knowledge  useful  in  this  skill  is  not  as  clear.  For  example,  Kieras  &  Bovair  (1984) 
found  that  a  description  of  how  a  device  worked,  or  a  device  model,  affected  the 
acquisition  of  device  operation  skills  by  increasing  the  speed  of  acquisition,  accuracy  of 
retention,  and  speed  of  performance.  Reder,  Charney,  and  Morgan  (1986),  found  that 
certain  types  of  instructional  elaborations  facilitated  the  acquisition  of  procedural  computer 
skills  while  others  did  not.  Syntactic  elaborations  (i.e.  how-its-done  knowledge)  improved 
learning  outcomes,  whereas  conceptual  elaborations  (i.e.  what-it-is  knowledge)  produced 
little  effect. 

This  study  was  simply  concerned  with  the  effects  of  providing  a  meaningful  model 
of  device  operation  within  the  context  of  interactive  practice  sessions.  Little  guidance  was 
given  on  how  to  interpret  the  dynamic  model,  and  no  instruction  directed  the  subjects  to 


actively  use  the  model  in  forming  meaning  from  the  device  procedure.  These 
deficiencies,  however,  did  not  prevent  subjects  from  using  the  model  advantageously. 

MEIH-QP 

Design:  Independent  variables  for  the  experiment  included  the  availability  of  a 
dynamic  graphical  model  (present  or  absent  from  the  practice  trials),  six  practice  trials, 
and  three  test  trials.  Subjects  were  randomly  assigned  to  one  of  two  treatments.  Each 
treatment  included  three  phases;  an  instruction  phase,  a  practice  phase,  and  a  test 
phase.  Dependant  variables  were  completion  time  and  number  of  errors  for  both  sets  of 
practice  and  test  trials. 

Subjects:  Twenty-nine  subjects  supplied  through  a  temporary  employment  agency 
were  paid  for  their  participation  in  the  study.  Selection  was  limited  to  high-school 
graduates  between  the  ages  of  18  and  27  years.  One  subject  was  excluded  from  failure 
to  complete  the  final  test  trial.  Subjects  completing  the  experiment  early  were  given 
additional  assignments. 

Materials:  A  console  operations  task  was  designed  to  loosely  represent  a  real- 
world  task  in  remote  crane  control  arm  operation.  The  task  involves  the  operation  of  a 
console  to  lift,  transport,  and  load  canisters  from  storage  bins  into  rail  cars.  While  skilled 
console  operators  follow  an  interacting  set  of  proscribed  procedures,  a  single  51 -step 
procedure  was  selected  for  this  experiment. 

To  provide  instruction,  practice,  and  testing  for  the  task,  a  computer  simulation  was 
developed.  The  simulation  displays  a  complete  control  panel  consisting  of  a  set  of  29 
interrelated  buttons,  knoos,  and  switches.  The  controls  respond  to  clicks  of  a  mouse  with 
numerous  switch  and  knob  settings,  meter  readings,  indicator  lights,  and  an  occasional 
beep.  All  panel  components  are  appropriately  labeled  and  organized  by  function  on  the 
lower  half  of  the  computer  screen. 
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The  upper  half  of  the  computer  screen  is  reserved  for  on-screen  messages, 
additional  user  options  (i.e.  “Assistance"),  and  the  dynamic  graphical  display.  On-screen 
messages  include  the  practice  or  test  trial  number,  a  description  of  the  step  to  perform 
(e.g.  "Set  Crane  Control  to  'Enable'"},  and  feedback  messages  for  incorrect  performance 
(e.g.  "Incorrect.  Click  as  indicated  by  the  arrow."). 

Selecting  the  "Assistance"  option  reveals  a  large  red  arrow  indicating  the  location 
of  the  next  button  or  switch,  or  knob  position.  An  incorrect  action  in  the  procedure  would 
automatically  select  the  assistance  option  and  display  the  red  arrow. 

The  dynamic  graphical  display,  centered  at  the  top  of  the  screen,  depicts  a 
representation  of  the  loading  task  which  dynamically  responds  to  control  panel  actions. 
The  display  provides  a  view  of  the  task  as  if  looking  down  onto  the  crane/loading  area. 
Low-fidelity  animated  sequences  demonstrate  realistic  responses  of  crane-console 
interactions. 

The  simulation  software  was  developed  using  the  ToolBook  programming 
environment  and  was  delivered  via  a  laboratory  of  22  individual  computer  stations.  The 
stations  were  equipped  with  Compaq  486/33L  microprocessors,  VGA-quality  monitors, 
keyboards,  and  right-handed,  three-button  mice. 

Procedure:  Subjects  were  brought  into  the  laboratory  in  groups  of  15  individuals. 
One  of  two  treatments  was  randomly  selected  for  the  group.  After  a  brief  orientation  in 
use  of  the  equipment,  the  subjects  were  instructed  to  proceed  through  the  program  at 
their  own  pace.  The  program  included  three  phases:  an  instructional  phase,  a  practice 
phase,  and  a  testing  phase. 

The  instructional  phase  provided  an  introduction  to  the  task  of  operating  the  crane 
control  arm.  This  brief  tutorial  introduced  and  explained  the  task  of  crane  control 
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operations  through  a  dynamic  graphical  model  or  display.  The  graphical  model  illustrated 
how  the  actions  of  the  crane  might  appear  as  if  the  situation  was  viewed  through  a 
window.  The  instruction  did  not  provide  any  specific  instructions  in  crane  operation,  nor 
did  it  show  or  mention  the  related  control  panel.  Finally,  this  phase  concluded  with 
directions  of  how  the  user  should  interact  with  the  system  during  the  practice  and  test 
phases. 

During  the  practice  phase,  subjects  completed  6  trials  of  the  crane  control 
procedure.  A  text  prompt  appeared  on  the  screen  to  direct  the  subject  in  performing  the 
individual  steps.  An  on-screen  "Assistance"  option  was  available  that  when  selected 
indicated  the  location  of  the  next  button  or  switch  with  a  large  red  arrow. 

During  practice,  one  treatment  group  had  available  an  on-screen  display  identical 
to  the  graphical  model  presented  in  the  instruction  phase.  This  model  represented  crane 
arm-control  panel  interactions  through  dynamic  changes  of  the  display.  The  dynamic 
mode!  was  not  present  for  the  control  group. 

Following  six  practice  trials,  subjects  entered  the  test  phase.  In  this  phase, 
subjects  made  three  attempts  at  performing  the  crane  control  procedure  without  the  aid 
of  text  prompts  or  assistance.  Neither  treatment  group  had  available  the  dynamic 
graphical  model  during  testing  trials.  If  an  error  was  made  during  testing,  the  red 
assistance  arrow  would  direct  the  subject  to  the  next  step,  forcing  ultimate  correct 
performance. 

R.ESU.LIS 

Mean  completion  .'imes  and  errors  for  nine  trials  (six  practice,  three  test)  are  shown 
in  Figures  1  and  2  respectively.  Nearly  equivalent  completion  times  and  errors  were 
recorded  for  each  of  the  six  practice  trials.  Test  trials  yielded  significant  effects  between 
groups  for  both  completion  times,  (F{1,26)  =  12.96,  p  <  .001]  and  errors  [F(1,26)  =  15.59, 
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p  <  .001]. 


These  results  indicate  that  while  the  presence  of  a  dynamic  model  during  practice 
of  this  procedural  skill  did  not  alter  time-on-task,  the  model  did  aid  skill  performance  in 
testing  of  the  procedure  without  the  model.  Further  investigations  into  how  dynamic 
models  may  facilitate  the  acquisition,  retention,  and  transfer  of  procedural  skills  may  yield 
significant  recommendations  for  the  training  of  device  operation  skills. 


Figure  1;  Completion  Time  by  Trial 
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Figure  2:  Errors  by  Trial 
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ABSTB&CT 

Xt  ia  kno%m  that  occupanta  of  vahicla  rollovar  accidanta  axparianca 
cooplicatad  and  of tan  violant  motion.  In  ordar  to  haightan  atandarda  for 
occupant  aafaty  during  rollovar  craahaa,  additional  raaaarch  of  occupant 
bahavior  in  rollovar  craahaa  ia  nacaaaary.  Raaaarch  uaing  actual  vahiclaa 
would  ba  axtremaly  expanaiva  and  tima  conauning.  Altarnativaly,  raaaarch  on 
rollovar  accidanta  ia  baing  dona  with  a  human  body  groaa  ax>tion  aimulation 
program,  callad  tha  Articulatad  Total  Body  (ATB)  modal.  In  ordar  to  produca 
raliabla  data,  thia  modal  muat  ba  validatad.  Validation  ia  accompliahad  by 
comparing  actual  taat  raaulta  to  a  computar  aimulation.  In  particular,  tha 
varification  of  tha  ATB  modal  waa  attamptad  by  comparing  tha  motion  of  tha 
aimulatad  vahicla  and  occupant  to  that  racordad  on  high  apaad  film  during  an 
actual  30  mph  craah,  in  which  tha  vahicla  rollad  ovar  ona  tima  and  tha 
occupant  waa  a  Hybrid  III  dummy.  Tha  vahicla  simulation  had  an  accaptabla 
coiaparison  to  tha  taat,  but  tha  occupant  sianalation  did  not  compara  as  well. 
Tha  difficulty  in  obtaining  a  reasonable  occupant  simulation  was  due  to  tha 
unrealistic  initial  conditions  of  tha  taat,  not  to  tha  capabilities  of  tha  ATB 
modal.  Further  work  in  simulating  other  rollovar  testa  ia  necessary  to  fully 
validate  tha  predicted  motion  of  an  occupant  that  is  produced  by  tha  ATB 
modal. 
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VALXDATKM  OF  THX  ARTZCDLATSO  TOTAL  BODY  MC»BL 
RBGARDIMO  ROLLOVKR  CRASHES 

J*nnif«r  M.  F*rst 


Occupant  injuries  that  occur  In  rollover  crashes  tend  to  be  very  severe 
and  damaging.  There  were  over  9600  persons  killed  in  rollover  accidents  of 
passenger  cars,  pickup  trucks,  passenger  vans,  and  utility  vehicles  in  1989. 

To  improve  the  survivability  of  an  occupant  and  the  crashtrorthiness  of 
vehicles  in  rollover  crashes,  research  is  necessary.  One  form  of  research  is 
to  use  a  computer  model  to  simulate  this  type  of  crash.  The  model  that  has 
been  used  and  validated  to  predict  frontal  crash  results  is  the  Articulated 
Total  Body  (ATB)  model. This  model  has  not  been  validated  in 
rollover  crash  situations.  Rollover  crashes  involve  three  dimensions  where 
frontal  crashes  involve  primarily  two.  There  are  many  advantages  in 
validating  the  ATB  model.  One,  if  the  isodel  can  accurately  predict  the 
behavior  of  an  occupant  involved  in  a  rollover  crash,  initial  conditions  of 
the  vehicle  amtion  can  be  inserted  into  the  model  and  results  can  be  sought 
without  the  physical  testing  of  vehicles.  This  would  save  sioney  and  time. 

Two,  presently  there  are  no  vehicle  standards  for  the  automobile  industry 
regarding  the  protection  of  occupants  if  involved  in  rollover  crashes.  Zf  the 
ATB  model  is  validated  in  predicting  occupant  behavior,  parametric  studies 
could  be  performed  to  aid  in  the  development  of  standards  for  the  automobile 
industry.  A  paraoMtric  study  can  include  variations  such  as  the  study  of  the 
effects  of  roof  crush  or  the  study  of  how  different  safety  devices  in  a 
vehicle  could  prevent  injuries.  Assuming  this  were  possible,  the  general 
safety  and  welfare  of  everyone  who  was  an  occupant  in  an  automobile  could  be 
improved. 

Blggg,ag.iag.  QT  PTOBLEM 

since  the  motion  of  a  vehicle  and  its  occupant  is  extremely  violent  and 
complicated  in  a  rollover  crash,  the  Vulnerability  Assessment  Branch  of 
Armstrong  I,aboratory  has  been  given  the  task  of  validating  the  ATB  model  with 
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r«*p«ct  to  thia  typa  of  autonobila  accident.  This  work  has  been  comBissioned 
by  the  National  Highway  Traffic  Safety  Administration  (NHTSA).  The  goal  is 
for  the  ATB  model  to  be  able  to  predict  the  dynamics  of  occupant  motion  in  a 
rollover  crash.  The  model  is  being  validated  by  matching  the  occupant 
kinematics  calculated  by  computer  simulations  to  tests  that  have  been 
performed  by  the  Transportation  Research  Center  (TRC)  of  Ohio.  These  tests 
have  been  conducted  with  different  vehicles  including  Dodge  Caravans  and 
Nissan  pickup  trucks.  A  1989  Nissan  pickup  truck  was  evaluated  while  in  the 
summer  research  program. 

MgTHODOLOGY 

For  several  years,  the  ATB  model  has  been  used  at  Armstrong  Laboratory 
for  simulating  and  predicting  gross  human  body  dynamics.  The  model  has  aided 
in  understanding  what  might  happen  in  different  situations  such  as  aircraft 
ejection,  frontal  impacts,  automobile  panic  breaking,  and  sustained 
accelerations.  Recently,  the  ATB  model  has  been  used  in  the  research 

of  automobile  rollover  crashes.  Because  of  its  proven  predictive 
capabilities,  the  ATB  model  has  been  identified  and  tested  as  a  potential  tool 
to  predict  the  motion  of  occupants  of  rollover  crashes.  However,  the  model 
requires  validation  so  future  reliance  can  be  attributed  to  its  results. 

The  model  is  based  on  rigid 
body  dynamics.  This  allows  each  part 
of  the  body  to  be  represented  by  a 
rigid  segment.  The  segments  are 
connected  in  a  tree  structure  by 
joints.  The  Hybrid  III  dummy  is 
represented  by  a  total  of  seventeen 
segments  coupled  by  sixteen  joints. 

Bach  segment  has  mass,  moments  of 
inertia,  and  a  contact  ellipsoid 
associated  with  it.  An  example  of 
the  occupant,  represented  by 
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•lllpvoids,  can  b«  scan  in  figura  1.  Ha  la  aitting  in  tha  1989  Niaaan  pickup 
truck,  which  is  stationary.  Tha  input  data  fila  o£  tha  ATB  modal  raquiras 
dafinition  of  tha  sagmants,  tha  joint  charactaristics,  tha  surrounding 
anvironmant,  which  is  dascribad  with  planas,  tha  motion  of  tha  vahicla,  tha 
contacts  of  tha  sagmants  with  aithar  planaa  or  othar  sagmants,  tha  harnasa 
bait  dafinition  and  contact  points,  and  tha  gravity  forcas.  Tha  output  of  tha 
modal  can  include  tima  history  data  of  linear  and  angular  accelerations, 
velocities,  displacamants,  sagmant-sagmant  contacts,  segment -plana  contacts, 
joint  forcas  and  torques,  and  othar  force  or  contact  information.  Thera  is 
also  a  visual  output  available.  It  graphically  depicts  tha  body  position  with 
respect  to  its  environment  at  certain  time  intervals.  This  graphical  output 
is  available  with  a  graphical  display  program  developed  by  tha  Air  Force 
Institute  of  Technology  for  a  Silicon  Graphics  computer. 

Most  of  the  standard  measurements  required  for  tha  ATB  model  are  based 
upon  the  inertial  coordinate  system.  Tha  origin  of  this  coordinate  system  is 
defined  where  test  data  begins  to  be  collected.  Tha  positive  x  direction  is 
in  tha  direction  of  motion.  Tha  positive  y  direction  is  to  tha  right,  and  tha 
positive  s  direction  is  downward.  Soma  information,  such  as  the  interior 
planes  of  the  vehicle,  is  based  upon  tha  vehicle  coordinate  system.  In  this 
coordinate  system,  tha  origin  is  tha  center  of  gravity  of  tha  vahicla.  Tha 
positive  X  direction  is  toward  tha  front  of  tha  vahicla.  Tha  positive  y 
direction  is  to  the  right  side  of  the  vehicle,  and  tha  positive  z  direction  is 
downward.  Yaw,  pitch,  and  roll  angles  are  significant  factors  in  tha  ATB 
model.  Yaw  is  tha  rotation  about  tha  z  axis.  Pitch  is  tha  rotation  about  tha 
y  axis,  roll  is  tha  rotation  about  tha  x  axis. 

In  order  to  validate  the  ATB  model,  two  simulations  ware  performed 
separately.  These  were  the  vehicle  simulation  and  occupant  simulation.  Tha 
vahicla  simulation  was  done  first,  so  that  tha  vahicla  kinematics  obtained 
could  be  used  in  the  occupant  simulation.  In  this  case,  tha  vehicle  portion 
simulated  tha  motion  of  a  1989  Nissan  pickup  truck,  which  was  released  from  a 
NHTSA  rollover  cart  at  30  mph,  rolled  one  full  rotation,  and  came  to  rest  91.5 
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fMt  fron  Its  Tha  occupant  portion  aimulatad  tha  motion  of  a 

thraa-point  unibalt  rastrainad  and  Instrumantad  Part  572B  dunmy. 

Initial  conditions  wara  a  major  and  crucial  aspact  of  tha  vahicla 
aioiulation  imput  fila.  Other  information,  including  tha  geometry  of  the 
vahicla  and  force  deflection  characteristics  for  each  possible  contact  that 
tha  vahicla  might  have  wara  also  significant  input  parameters.  These  wara 
found  in  tha  test  report  and  tha  data  acciuaulatad  from  tha  test.  Tha  initial 
conditions  that  vrara  needed  include  yaw,  pitch,  and  roll  angles,  x,  y,  and  z 
displacements,  linear  velocities,  and  x,  y,  and  z  angular  velocities.  Once 
these  initial  conditions  vrere  calculated,  they  were  entered  into  the  ATB 
model.  The  ATB  model  then  simulated  four  seconds  of  motion.  At  this  point, 
the  actual  test  results  and  simulation  results  were  compared. 

For  the  test,  the  vehicle  was  placed  on  a  NHTSA  rollover  cart  situated 
such  that-  the  vehicle's  right  side  would  have  initial  contact  with  the  ground. 
The  vehicle  and  cart  were  towed  as  a  unit  until  they  reached  a  constant  speed 
of  30  mph.  The  cart  and  vehicle  rolled  over  a  switch  which  caused  hydraulic 
cylinders  to  actuate.  These  cylinders  pushed  the  truck  from  the  cart,  and  the 
cart  decelerated.  The  vehicle  was  instrumented  with  a  three-axis 
accelerometer  and  a  three-axis  angular  rate  gyro.  The  rollover  cart  was 
instinimented  with  a  three-axis  accelerometer.  The  test  was  filmed  by  six 
high-speed  motion  picture  cameras  that  recorded  at  a  speed  of  approximately 
500  frames  per  second.  One  real  time  panning  camera  was  also  used.  The 
initial  roll  and  pitch  at  rest  were  calculated  from  measurements  of  the  truck 
on  the  cart.  It  was  found  that  the  initial  roll  angle  was  29.2  degrees  and 
the  initial  pitch  was  .58  degrees.  An  assumption  was  made  that  the  initial 
yaw  was  -90.0  degrees. 

In  the  ATB  model,  there  would  be  a  difficulty  in  modeling  the  truck 
while  it  was  still  on  the  cart.  So,  the  vehicle  simulation  began  once  the 
vehicle  was  completely  separated  from  the  cart.  The  time  at  which  the  truck 
has  been  coo^letely  freed  from  the  cart  was  referred  to  as  the  vehicle 
simulation  start  time.  This  time  was  determined  by  examining  two  plots  in  the 
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t«at  raport  laballad  *v«hicl«/roll 
cart  saparatlon  tiiaa  -  upper  ewitch” 
and  ”vahicIa/roIl  cart  separation 
tlaw  -  lower  switch.  These 

graphs  are  illustrated  in  figures  two 
and  three.  The  switches  that  the 
plot  referred  to  were  those  that  were 
mounted  to  each  side  of  the  truck  and 
rollover  cart.  Whan  the  vehicle 
broke  free  from  the  cart,  the 
switches  were  broken,  which  in  turn 
sent  a  signal  for  each  switch.  From 
the  plots,  it  appeared  that  the 
vehicle  was  fully  separated  from  the  cart  at  .710  seconds.  From  further 
investigation,  mainly  viewing  the  film,  it  was  determined  that  the  truck  was 
only  partially  free  from  the  cart  at  .710  seconds.  The  vehicle  was  actually 
completely  separated  from  the  cart  at  1.0  second.  This  time  was  chosen  to  be 
time  aero  for  the  vehicle  simulation. 

Since  the  test  start  time  and  simulation  start  tioie  were  different, the 
roll,  pitch,  and  yaw  angles  had  to  be  calculated  for  the  vehicle  start  time. 
This  was  done  by  integrating  the  angular  velocity  test  data  in  all  three 
dimensions  from  0.0  to  1.0  second.  The  integration  was  calculated  using 
SigmaPlot.  The  integrated  number  at  1.0  second  was  the  angular  position  at 
that  time,  which  was  the  beginning  of  the  vehicle  simulation.  Therefore,  the 
roll  actually  was  87.4  degrees,  the  pitch  was  1.61  degrees,  and  the  yaw  was  - 
92.9  degrees. 

The  linear  displacements  were  calculated  differently  for  each  dimension. 

In  the  z  direction,  the  displacement  was  the  vertical  distance  between 
the  ground  and  the  vehicle's  center  of  gravity  at  one  second.  This  height  was 
obtained  from  using  the  vehicle's  and  cart's  measurwnents,  photographs  of  the 
truck  on  the  cart,  and  the  roll  angle,  which  was  a  previous  calculation.  This 
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diatanc*  was  found  to  be  -63.12  inches.  The  linear  displacwment  in  the  y 
direction  was  assuned  to  be  negligible  since  the  yaw  was  estioated  fro«a 
photographs,  zero.  The  x  linear  displaceokent  was  the  distance  the  tiruck 
travelled  from  the  beginning  of  the  test  to  1.0  second.  The  cart  acceleration 
tost  data  was  used  to  calculate  this.  The  cart  data  could  be  used  in  this 
dimension  because  the  vehicle  and  cart  travelled  at  the  same  rate  for  one 
second.  The  calculation  involved  several  integrations,  conversions,  and  an 
assumption  that  the  cart  reached  a  constant  speed  of  30  mph.  The  vehicle 
travelled  428.7  inches  in  one  second.  These  displacements  described  the 
position  of  the  pick-up  at  the  vehicle  simulation  start  time. 

The  linear  velocity  was  calculated  in  the  x  direction  only.  In  the  y 
and  z  dimensions,  the  linear  velocity 
was  assumed  to  be  zero.  These 
assximptions  were  based  on  the  fact 
that  the  roll  angle  of  the  vehicle 
was  approx iflMLtely  90  degrees. 

Therefore,  the  tangential  velocity 
had  a  component  only  in  the  x 
direction.  The  linear  velocity  in 
the  X  direction  was  calculated  using 
the  cart's  acceleration  data.  The 
data  was  integrated  from  0  to  1.0 
second,  and  the  rate  of  528  inches 
per  second  (30  mph)  was  added  to  the 
integrated  accelerations.  So,  the 
linear  velocity  in  the  x  dimension 
was  429.01  inches /second.  This  was 
the  rate  at  which  the  truck  was 
travelling  in  the  x  direction  at  the 
vehicle  simulation  start  time. 

The  angular  velocity  was 
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calculated  In  the  x,  y,  and  z  diractiona.  Thaaa  plots  are  figures  4,  5,  and 
6.  These  were  determined  from  the  test's  angular  velocity  data.  Zn  the  test 
report,  angular  velocity  was  plotted  versus  tinsi  in  all  three  dimensions. 

From  studying  these  graphs,  the  vehicle  appeared  to  have  a  constant  angular 
velocity  between  1.0  and  1.25  seconds.  The  data  was  averaged  within  this  time 
interval.  Zn  the  x  direction,  the  average  angular  velocity  was  145.65 
degrees/second.  Zn  the  y  direction,  the  average  was  7.93  degrcMs/ second,  and 
the  average  was  -7.21  degrees/second  in  the  z  direction.  These  figures 
prescribed  the  rolling  motion  at  the  vehicle  simulation  start  time. 

Once  the  vehicle  simulation  was  acceptable,  the  occupant  simulation 
input  file  could  be  established.  First,  timing  infoinsation  had  to  be 
acquired.  From  observing  the  film,  it  was  evident  that  the  occupant  began  to 
move  before  the  vehicle  simulation  start  time.  The  occupant  simulation  should 
begin  when  the  occupant  is  at  rest.  Generally,  the  occupant  was  at  rest 
before  the  hydraulic  cylinders  on  the  cart  began  to  push  the  vehicle.  So,  to 

determine  the  exact  time  that  the 
cylinders  began  to  move,  plots  were 
studied.  These  were  in  the  test 
report  labelled  "roll  cart  left 
cylinder  displaceoient"  and  "roll  cart 
right  cylinder  displacement. 

These  graphs  are  illustrated  as 
figures  7  and  8.  The  time  that  the 
cylinders  began  to  move  was  0.5 
seconds.  This  time  was  chosen  to  be 
the  occupant  simulation  start  time. 

Additional  data  required  for 
the  occupant  simulation  were  the  time 
histories  of  vehicle  displacements  in 
all  directions  and  the  vehicle  yaw, 
pitch,  and  roll  angles.  Most  of  these  data  came  from  the  output  of  the 
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v«hicl«  ■Imulatlon,  but  aoaw  data  had  to  ba  calculated  because  of  the 

differences  between  the  occupant  sisiulation  start  tine  and  the  vehicle 

simulation  start  time.  As  stated  earlier,  the  occupant  simulation  start  time 
was  at  0.5  seconds  of  the  test,  and  the  vehicle  simulation  start  tine  was  at 
1.0  second  of  the  teat.  The  vehicle  displacements  and  angular  orientations 

had  to  be  calculated  between  0.5  and  1.0  second  of  the  test  in  order  to  have 

all  of  the  data  needed  to  run  the  occupant  simulation.  These  calculations 
were  based  on  the  vehicle  test  data. 

In  order  to  eliminate  anomalies  in  the  data,  the  test  data  was  filtered 
at  22  Hz.  To  calculate  the  angular  orientations  and  vehicle  displacements, 
the  same  procedure  was  performed  as  before  when  calculating  the  yaw,  pitch, 
and  roll  angles,  and  x  and  z  linear  displacements  for  the  vehicle  simulation 
start  time.  After  the  calculations  had  been  completed,  points  were  chosen 
every  40  milliseconds  starting  at  0.5  seconds  of  the  test.  These  points  were 
chosen  up  to  1.0  second  of  the  test.  The  data  from  1.0  to  5.0  seconds  of  the 
test  was  obtained  from  the  time  history  output  files  produced  by  the  vehicle 
simulation. 

BESTLTg 

After  the  initial  conditions  were  determined  for  the  vehicle  simulation, 
they  were  inserted  into  the  ATB  model  input  file,  and  the  vehicle  motion  was 
simulated.  The  simulation  was  compared  with  the  film.  The  timing  of  the 
initial  hit  was  not  correct.  The  initial  roll  angle  did  not  compare  well,  and 
the  vehicle's  linear  displaceorant  in  the  x  direction  was  not  large  enough. 
Adjustments  were  made  to  these  initial  conditions  in  order  for  the  simulation 
to  be  correct.  The  initial  roll  angle  actually  used  in  the  vehicle 
simulation,  in  order  to  match  the  film,  was  95.4  degrees.  This  change  in  roll 
angle  altered  the  calculations  of  the  linear  displacements  in  the  x  and  z 
directions.  The  corrected  value  of  the  x  linear  displacement  was  434.18 
inches,  and  the  z  displacement  was  -59.4  inches.  However,  additional 
adjustments  were  needed  for  the  linear  displacesientB.  The  vehicle  needed  to 
travel  further  in  the  x  direction  in  a  shorter  amount  of  time.  To  achieve  a 
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l*rg«r  x  linaar  displacwnant,  tha  linaar  valocity  had  to  ba  incraaaad. 
Initially,  tha  aaaumption  waa  mada  that  tha  cart  raachad  a  constant  valocity 
of  30  mph.  This  assumption  was  incorract.  From  obsarving  tha  film,  it 
appaarad  to  ba  mora  corract  to  assuma  that  tha  cart  raachad  a  constant 
valocity  of  31  o^h.  Tharafora,  tha  linaar  valocity  in  tha  x  diraction  was 
changad  to  447.0  inchas/sacond.  This  naw  assumption  altarad  tha  x 
displacamant  to  493.7  inchas.  Tha  z  linaar  displacamant  had  to  ba  adjustad  to 
-49.12  inchas.  This  changa  allowad  tha  vahicla  to  fall  to  tha  ground  in  a 
shortar  amount  of  tima.  Tha  naad  for  this  adjustamnt  could  ba  dua  to  tha  many 
assumptions  that  vrara  takan  whila  originally  calculating  tha  s  linaar 
displacamant,  such  as  scaling  cart  dimansions  from  a  photograph.  Aftar  all 
changas  wara  <nada,  tha  vahicla  simulation  was  accaptabla  and  matched  tha 
vahicla  motion  on  tha  film  fairly  wall. 

Aftar  tha  nacessary  calculations,  all  of  tha  information  naadad  to  run 
tha  occupant  simulation  was  antarad  into  tha  occupant  simulation  input  £ila. 
Tha  simulation  was  run,  and  tha  simulatad  occupant  initially  did  not  bahava 
similarly  to  tha  test's  occupant.  Tha  origination  of  tha  problem  appaarad  to 
ba  in  tha  x  and  z  displacixnant  data  from  tha  occupant  simulation  start  tima  to 
0.5  seconds  of  tha  simulation.  This  was  tha  data  that  was  calculated,  not 
from  tha  vahicla  simulation  output  data.  To  visually  sea  the  data  tha  x  and  z 
displacamants  wara  graphed  versus  time,  in  order  to  aliminata  data  points 
that  might  causa  problems  in  tha  simulations,  curve  fits  ware  performed  for 
tha  graphs  of  tha  x  and  z  displacements  between  tha  times  of  0.0  to  0.7 
seconds  of  tha  simulation.  Tha  curve  fits  ganaratad  new  data  points.  These 
ware  antarad  into  tha  input  data  file,  and  tha  simulation  was  run  again.  Tha 
occupant's  motion  follovrad  tha  film  for  the  first  400  milliseconds,  but  than 
travailed  in  tha  opposite  direction  from  what  ho  should  have.  Every  known 
source  of  error  waa  accounted  for,  and  tha  simulated  occupant  still  would  not 
follow  tha  motion  of  tha  film's  occupant.  Ha  moved  too  far  across  tha  seat 
toward  tha  passenger's  side.  The  simulation  was  considered  complete. 
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COWCLUSIOH 

Th«  problMM  that  occurrad  in  tha  occupant  aimulation  aaaroad  to 
orlglnata  in  tha  calculatad  vahicla  displacamant  data.  Thia  is  thought  to 
ha{^n  bacausa  whan  tha  occupant  simulation  bagan,  tha  vahicla  was  still  on 
tha  rollovar  cart.  Sinca  prasently  tha  cart  was  not  siodallad/  tha  data 
calculatad  to  account  for  that  motion  was  an  aducatad  guass.  If  tha  vahicla 
simulation  start  tima  and  occupant  aimulation  start  tisM  wars  idantical,  thase 
problams  thaoratically  would  not  hava  occurrad.  Tharafora,  tha  ATB  modal 
still  has  tha  potantial  to  pradict  tha  motion  of  an  occupant  who  is  involvad 
in  a  rollovar  crash,  but  furthar  work  is  raguirad.  This  %K>rk  would  involva 
baing  abla  to  modal  tha  vahicla  motion  whila  on  tha  cart  so  tha  occupant 
simulation  start  time  and  vehicle  simulation  start  tima  ware  tha  same.  Kora 
tests  of  tha  Nissan  pickup  truck  will  be  modelled  to  clearly  identify  if  this 
is  a  valid  problem. 
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Mechanical  Neural  Network  Models 

Michelle  A.  Fitzurka 
Doctoral  Candidate 
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Abstract 

The  ultimate  aim  of  this  study  is  to  further  our  current  understanding  of  the 
electroencephalogram  (EEC),  or  electrical  recording  of  the  activity  of  the  human  brain,  in 
its  resting  state  so  as  to  better  comprehend  the  effect  of  external  fields  applied  to  it.  To 
this  end,  we  invoke  a  neural  network  model  that  employs  the  language  and  formalism  of 
statistical  mechanics  to  order  to  explain  the  behavior  of  neurons  in  the  brain  on  average, 
speaking  of  the  probability  of  a  neuron  firing,  rather  than  predicting  the  individual  firing 
pattern  of  a  specific  neuron  at  a  specific  time.  While  there  is  much  to  be  gained  by  this 
novel  and  rich  approach,  caution  must  be  utilized  in  the  mathematical  development  and 
application  of  any  such  model  to  bioloical  systems.  We  intend  to  investigate  the  current 
statistical  mechanical  neural  network  models  and  suggest  ways  in  which  to  impose  more 
stringent,  biological  accuracy  on  models  of  brain  functioning  while  retaining  mathemati¬ 
cal  tractabillty.  Eventually,  we  see  a  need  to  summon  the  techniques  of  non-equilibrium 
statistical  mechanics,  a  promising  direction  which  to  date  has  not  been  wholly  explored. 


11-2 


Table  of  Contents 


Introduction: 

Overview  of  the  Literature  aud  Impetus  for  Further  Investigation  11-4 

1.  Equilibrium  Models: 

1.1  Overall  Justification  of  a  Statistical  Mechanical  Approach  11-5 

1.2  Background  of  Cowan’s  Work  on  Neuronal  Models  11-6 

1.3  Cowan’s  Statistical  Mechanical  Neuronal  Model  11-9 

2.  Non-Equilibrium  Models: 

2.1  Our  Corrections  to  Cowan’s  SMN  Model  11-14 

2.2  Future  Directions  11-16 

2.3  Applications  to  Health  and  Safety  11-18 

Conclusion  11-18 

References  11-19 

Endnotes  11-20 


11-3 


IVansmission  Delays  and  Bursting  in  Statistical 
Mechanical  Neural  Network  Models 

M.  A.  Fitzurka 


Introduction: 

Overview  of  the  Literature  and  Impetus  for  Further  Investigation 

The  history  of  the  mathematical  modeling  of  complex  dynamical  systems  is  both  long  and  vast,  as 
is  the  legend  of  the  modeling  of  biological  systems.  The  record  of  the  mathematical  modeling  of 
biological  systems  as  complex  dynamical  systems,  however,  is  not  as  rich  and  has  been  limited  to 
those  resolute  mathemeticians  who  have  attempted  to  remain  faithful  to  biological  realism.  While 
there  exists  a  wealth,  a  veritable  explosion,  of  information  on  the  utility  of  biologically-motivated 
mathematical  models,  strict  adherence  to  biological  integrity  has  been  neither  widespread  nor 
rigorous.  This  failing  has  been  well  noted  by  Dr.  Judith  DayhofF,  ^  in  her  textbook  Neural 
Network  Architectures:  An  Introduction,  wherein  she  provides  a  detailed  delineatation  of  the 
various  differences  between  current  artificial  neural  network  (ANN)  models  and  true  biological 
neural  networks  (BNN).  See  Table  1  in  Section  2.2.  There  is  a  striking  urgency  for  further  rigor 
in  addressing  the  precise  restrictions  of  biology. 

Historically,  coherence  to  biological  realism  has  been  avoided  due  to  the  greatly  com¬ 
plicated,  and  sometimes  intractable,  mathematics  that  is  involved  as  a  result  of  levying  strict 
biological  requirements.  Also,  in  many  instances,  extraordinary  practical  applications  have  over¬ 
shadowed  the  necessity  for  imposing  biological  accuracy.  The  broad  range  of  successful,  though 
biologically  imprec>'"-  models  has  forced  the  demands  of  biology  to  therefore  be  selectively  ig¬ 
nored  or  postponed.  Consequently,  for  reasons  of  mathematical  complexity  and  practical  efficacy, 
biologically-correct  neural  network  models  have  not  been  fully  developed  though  the  need  for 
them  is  profound.  It  is  our  hope  that  this  trend  be  reversed. 


*Di.  Dayhoff  has  a  PhD  in  biophysics  from  the  University  of  Pennsylvania  and  works  at  the  Naval  Surface 
Warfare  Center  in  Md.  She  is  on  the  board  of  the  International  Neural  Network  Society  (INNS)  and  has  expressed 
interest  in  my  work,  as  has  the  President  of  JNNS,  Dr.  Harold  Szn.  I  am  grateful  that  I  will  be  able  to  rely  on 
thw  knowledge  and  experience  for  insight  into  my  research. 
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Of  the  many  and  interesting  problems  that  could  conceivably  be  enlightened  by  more 
stringent  studies  of  biologicaJly-precise  mathematical  models,  those  questions  dealing  with  the 
health  and  safety  of  human  beings  would  inarguably  be  of  highest  priority.  Taking  precedence, 
therefore,  would  be  an  investigation  for  a  precise  model  of  the  resting  (EEG)  state  of  the  human 
brain,  which  as  of  yet,  has  eluded  conclusive  explanation.  This  most  basic  yet  fundamentally 
crucial  element  in  the  understanding  of  human  functioning  remains  unclear  despite  our  current 
and  extensive  knowledge  of  man,  so  its  decoding  would  be  of  great  benefit.  This  defines  our 
motivation. 

We  proceed  by  expounding  a  model  of  J.  D.  Cowan.  In  1970,  Cowan  resurrected  and 
remarkably  refined  the  mathematics  of  a  model  of  the  statistical  dynamics  of  population  kinetics 
introduced  initially  by  E,  H.  Kerner  in  1957.  His  model  has  enormous  potential  and  represents 
the  starting  ground  for  our  analysis.  A  detailed  description  of  this  model  is  in  order  at  this 
point.  Afterwards,  there  will  be  a  discussion  of  our  intention  to  improve  this  scheme,  followed 
by  an  indication  of  what  we  forsee  as  the  future  possibilities  and  advantages  of  pursuing  this 
perspective,  most  especially  with  regard  to  the  safety  and  integrity  of  man. 


1.  Equilibrium  Models: 

The  model  of  Cowan  belongs  within  a  category  of  models  which  evoke  equilibrium  statistical 
mechanics,  and  therefore  we  term  this  section  equilibrum  models.  It  will  also  be  seen  that  as 
an  extension  to  Cowan’s  work,  more  advanced  statistical  mechanics  will  eventually  be  required. 
Accordingly,  there  is  a  section  to  follow  covering  these  more  sophisticated  topics. 

1.1  -  Overall  Justification  of  a  Statistical  Mechanical  Approach 

In  order  to  warrant  the  employment  of  a  statistical  mechanical  approach  in  explaining  the  func¬ 
tioning  of  the  brain,  it  is  necessary  to  view  the  brain’s  average  behavior  as  an  aggregate  of  the 
behavior  of  the  individual  neurons  that  comprise  it.  As  there  are  roughly  10^^  neurons  [1]  each 
with  10®  axonal  connections  [2]  to  other  neurons;  there  are  therefore  a  minimum  of  10*®  to¬ 
tally  interconnected  pathways  within  the  brain,  a  number  that  approaches  Avagadro’s  number, 
N  ~  10*^.  Certainly,  it  is  fitting  and  proper  for  us  to  consequently  consider  the  “motion”  (firing 
rates)  of  each  of  these  “particles”  (neurons)  on  average,  thus  invoking  the  entire  langua^ge  and 
formalism  of  statistical  mechanics. 
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Many  other  important  averages  and  quantities  of  ultimate  interest  may  be  determined  by 
taking  advantage  of  the  machinery  of  statistical  mechanics.  The  average  firing  rates  of  neurons, 
or  activity  in  the  brain  is  considered  to  be  the  formal  analogue  of  “energy”  and  a  subsequent 
Hamiltonian  mechanics  proceeds.  The  “temperature,”  corresponding  to  the  fluctuation  in  this 
activity,  admits  a  Gibbs  Ensemble  theory  for  these  variations.  Obtaining  these  and  other  desired 
values  requires  the  introduction  of  some  of  the  vocabulary  of  neurophysiology,  neural  networks, 
physics,  and  biophysics. 


1.2  -  Background  of  Cowan’s  Work  on  Neuronal  Models 

The  model  developed  by  Cowan  borrows  heavily  from  physics  and  cultivates  several  of  the  fun¬ 
damental  concepts  and  ideals  used  in  physical  modeling.  The  first  of  these  pertinent  analogies 
enlists  the  aid  of  the  terminology  of  the  classical  “lumped  equivalent-circuit”  of  elementary  circuit 
theory  in  order  to  describe  the  functioning  of  the  somato-dendritic  neural  membrane.  Under¬ 
standing  of  the  mechanisms  dictating  the  workings  of  this  membrane  is  key  in  developing  a 
cellular  or  neuronal  model  justifying  this  as  a  valid  starting  point.  Figure  1,  taken  directly  from 
Cowan’s  article  (1970),  depicts  the  equivalent  analogous  circuitry  representing  neural  membrane 
behavior.  To  first  approximation,  all  of  the  relevant  information  on  the  linear  depolarization  of 
the  membrane  can  be  represented  by  the  following  first-order  differential  equation 


i~Em) 


-t- 


{Ei-EmY 

Ri 


where  r  is  the  time  constant  of  the  membrane  given  by 


(1) 


1  _  _L  rj_  _L .  j_l  (2) 

reducing  to  r  =  RmCm  with  open-circuited  synapses.  These  equations  and  the  associated  Fig¬ 
ure  1  provide  a  reasonable  basis  for  initiating  a  mathematic<il  description  of  biological  phenomena. 

In  this  scheme,  the  neural  membrane  is  a  linear  system  with  as  an  applied  mem¬ 
brane  current  Cm  is  the  membrane  capacitance.  Re  and  Ri  are  the  excitatory  and  inhibitory 
synaptic  resistences  combined  in  parallel.  The  and  gt  terms  represent  exictatory  and  inhibitory 
conductivities  while  Pm  is  the  membrane  conductivity.  Ei  and  Em  stand  respectively  for  the  in¬ 
hibitory  membrane  and  resting  membrane  potentials  while  v{t)  represents  the  membrane  potential 
measured  relative  to  the  resting  potential,  in  this  case  Vm#tde  -  VouuWe  —  E.  Resting  potentials  at- 
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tributable  to  the  cell  or  neuron’s  membrane  while  it  is  quiescent  are  typically  —70  mv  as  indicated 
in  the  figure.  The  IPSP  and  EPSP  symbols  represent  the  inhibitory  and  excitatory  po$t-iynaptic 
potentials,  labels  given  to  the  responses  to  the  pre-synaptic  excitation  [3]. 

outside 

IPSP  EPSP 

80  mv 

inside 

Figure  1:  Circuit  Model  for  the  Somato-Dendiitic  Neural  Membrane.  (Source:  Cowan,  J.D.  1970,  Some 
Maihemaiieal  Questions  in  Biology  in  the  Life  Sciences,  p.  48.) 

The  cell  membrane,  «  75k.  in  thickness,  is  a  highly  ordered,  bimolecular  lipoprotein 
layer  which  separates  the  fluids  between  and  within  the  cells,  interstitial  and  intracellvlar  fluids 
respectively.  The  composition  of  these  fluids  is  mostly  water  with  comparable  numbers  of  particles 
per  unit  volume  dissolved  within,  sodium  Na+  and  chloride  C71— ,  in  the  interstitial  fluid  and 
potassium  if  +  in  the  intercellular  flmd.  Variations  in  the  ionic  concentrations  in  and  outside  the 
cell  contributes  to  the  electric  potential  difference  between  these  two  fluids,  which  is  relieved  or 
lessened  only  by  the  passive  diffusion  or  active  transport  of  these  ions  through  channels  or  gates 
in  the  membrane  layer.  This  unidirectional  flux,  (inward  at  an  excitatory  synapse;  outward  at 
an  inhibitory  synapse),,  is  opposite  to  the  concentration  gradient  and  continues  imtil  the  net  flux 
is  aero  and  ionic  equilibrium  is  reached  [4]. 

In  its  active  state,  the  neuron  is  termed  dther  depolarized,  when  the  reduction  in  the 
magnitude  of  the  potential  across  the  membrane  toward  0  mv  [5]  has  the  effect  of  decreasing  the 
negativity  of  the  interior  of  the  cell  with  respect  to  the  exterior,  or  hyperpolarized,  when  there 
is  an  increase  in  the  potential  across  the  cell’s  membrane,  the  inside  becoming  more  negative, 
approaching  —80  mv  [5].  Depolarization  or  hyperpolarization,  therefore  refers  to  the  net  outward 
or  inward  ionic  movement  or  flux  of  ions  across  the  neuronal  membrane  at  irUtibitory  and  exct- 
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tatory  synapses.  These  synapses,  or  junctions,  are  the  connection  points  between  the  terminal 
axon  of  the  antecedent  cell  to  the  somato-dendritic  membrane  of  the  latter  cell. 

It  is  the  impulse,  or  action  potential  that  is  responsible  for  initially  depolarizing  the  cell 
until  it  reaches  a  characteristic  threshold  value,  at  which  time  its  reaction  is  to  initiate  an  impulse 
which  is  then  transferred  to  the  consecutive  neuron  via  the  synaptic  gap.  The  neuron  becomes 
hyperpolarized  and  a  refractory  period  of  time  occurs  before  it  returns  to  its  resting  state,  where 
it  is  able  once  again  to  fire.  It  is  the  specific  nature  of  this  impulsive  response  of  a  cell’s  membrane 
from  its  resting  state,  its  action  potential,  that  is  a  matter  of  fundamental  importance  and  marks 
our  point  of  departure  from  Cowan’s  work.  While  Cowan  allows  for  the  transmission  of  single 
impulses  in  the  mechanics  he  derives,  he  does  not  treat  refractory  times,  the  aU-or-nothing  aspect 
of  the  propagated  action  potential,  the  finite  speed  of  the  action  potential,  or  neuronal  bursting. 

For  the  action  potential,  Cowan  assumes  the  Heaviside-Dirac  impluse  function,  to  which 
the  membrane  responds  as 


hijit)  =  [it;y]exp(-t/r),  (3) 

«_i  being  the  Heaviside  step  function  and  Svij  being  the  deviation  of  the  membrane  potential 
due  to  the  arrival  at  the  ^h  cell  of  an  impulse  from  the  ith  cell.  He  choses  this  first-order 
linear  equation,  which  proceeds  directly  from  EQN(l),  since  it  describes  well  the  all-or-nothing 
character  of  neurons.  This  all-or-none  behavior  refers  to  response  of  the  neuron  being  fixed  in  size, 
shape,  duration,  and  conduction  speed  [6].  There  is  also  included  a  decay  term  which  accounts  for 
the  drop-off  in  the  effect  of  the  impulse  in  time.  The  accumulated  effect,  or  “built-up”  potential 
in  the  membrane,  is  consequently  represented  with  the  following  convolution  containing  EQN(3) 

-  r)fil,T)dT  (4) 

with  the  introduction  of  the  term  /j(r)  as  the  mean  frequency  of  the  arrival  of  impulses  from 
the  ^h  primary  cell.  This  equation  carries  implicitly  the  Unearity  of  the  system,  incorporating 
the  aggregate  nature  of  the  neuron  by  accouting  for  its  linear  response  to  multiple  inputs. 

Finally,  with  all  the  constants  written  explicitly  and  reduced  by  proper  approximations, 
Cowan  derives  an  expression  for  the  transmembrane  potential  resulting  from  a  large  number  of 
inputs 
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The  effective  operating  period  for  the  transmission  of  excitation  from  primary  to  secondary  cell  is 
(7,  the  membrane  and  corresponding  change  in  membrane  conductance  are  Qm  and  Sgij,  and  the 
change  in  membrane  potential  caused  by  the  action  of  the  impulse  from  the  jth  cell  is  Ej.  This 
expression  is  the  result  of  the  reduction  of  a  linear  differential  system,  EQN(l),  to  a  non-linear 
function  of  time,  EQN(5),  and  provides  a  straightforward  means  of  calculating  the  response  of  a 
membrane  to  its  many  inputs.  With  these  terms  defined  accordingly,  we  are  now  in  a  position 
to  show  how  Cowan  manipulates  these  quantities  to  develop  a  statistical  mechanical  formulation 
for  the  activity  of  a  large  ensemble  of  neurons. 

1.3  -  Cowan’s  Statistical  Mechanical  Neuronal  Model 

The  basic  equations  of  Cowan  are  extensions  of  the  work  begun  by  Kemer  in  1957  as  previously 
indicated.  The  equations  of  Keener  in  turn  find  their  origin  and  in  the  monumental  treatise 
of  Volterra  on  population  dynamics  in  1931  most  probably  instigated  by  the  suggestion  of  A.J. 
Lotka,  who  in  1925  remarked  that  "...  what  is  needed  is  an  analysis. ..that  shall  envisage  the  units 
of  a  biological  population  as  the  established  statistical  mechanics  envisage  molecules,  atoms,  and 
electrons;  that  shall  deal  with  such  average  effects  as  population  density,  population  pressure 
and  the  like  after  the  manner  in  which  thermodynamics  deals  with  the  average  effects  of  gas 
concentration,  gas  pressures,  ...”  [7]. 

WhUe  Volterra  proposed  a  system  of  differential  equations  to  describe  the  variation  in 
time  of  the  populations  Nr  of  interacting  species  in  a  biological  association,  it  was  Kerner  who 
noticed  first  that  this  system  admits  a  Liouville’s  theorem,  when  the  log  Nr  are  taken  to  be  the 
variables,  and  second  that  a  universal  “integral  of  motion”  ensues  [8].  Cowan's  contribution  to 
this  unique  development,  as  we  shall  see,  is  that  he  calculates  explicitly  the  statistical  mechanical 
averages  according  to  Kerner’s  prescription  and  shows  unambiguously  its  relation  to  the  specific 
problem  of  neural  activity. 

The  focal  point  around  which  the  dynamics  of  Cowan  is  centered  is  his  neural  equation 
associating  the  input  and  output  fj(t)  firing  rates.  It  is  from  this  equation  that  the  neural 
Hamiltonian  and  the  subsequent  statistics  succeed.  In  EQN(5),  we  already  have  the  connection 


between  the  output  firing  rates  and  the  membrane  potential.  By  recognizing  the  following  basic 
relationship 


6vij  =  iQijjCm  =  iSt/Cm, 


(6) 


we  can  convert  the  change  in  membrane  potential  to  the  corresponding  current  iStjCm 

produced.  Taking  appropriate  time  averages  in  EQN(5)  and  solving  for  i{t)  in  EQN(6)  gives  us 
the  current  flowing  in  the  cell  membrane  as  a  function  of  time 

(»i(0)  =  (^)  (exp(-t/cT^))  (7) 

or 

(8) 

with  (exp(— t/cr))  reducing  to  Qilctm-  Now,  we  have  the  output  firing  rates  also  related  to  the 
current. 

Next,  Cowan  choses  to  employ  a  logistic  function  to  represent  the  nonlinear  input-output 
relation  of  the  cell.  This  function  is  a  logical  choice  due  to  its  inherently  statistical  nature  and 
has  some  support  in  the  biological  literature;  however,  there  is  no  reason  to  see  a  priori  the 
distinct  advantages  that  follow  from  its  use.  It  is  chosen  in  part  because  the  logistic  function  is 
one  of  the  easier  sigmoidal  curves  to  work  with  [9].  Thus,  according  to  Cowan,  the  mean  rate  at 
which  the  tth  cell  emits  impulses  as  a  function  of  membrane  current  is 
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/«)=[r(l  +  e.p[-#([f]-l)])] 

This  equation  can  be  manipulated  further  by  substituting  EQN(8)  into  EQN{9)  to  yield 


(9) 


r  I  1  -I-  exp 


-1 


(10) 


Here  is  simply 


for  reasons  which  become  more  clear  when  0  is  understood  to  be  the  equivalent  of  fe  being 
the  standard  Boltzmann’s  constant  of  statistical  mechanics  equal  to  1.38  x  10~^^ 
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Finally,  we  have  an  expression  relating  the  output  firing  rates  directly  to  the  input  firing 
rates.  We  can  further  rearrange  this  equation,  by  letting 


to  reexpress  EQN(9)  as 


x{t)  =  1  -  rf(t)  =  1  -  r/r 


(12) 

(13) 

(14) 

(15) 


(16) 


The  X  variable  is  called  the  sensitivity  or  neural  sensitivity.  With  the  neural  equation  in  this 
form,  Cowan  makes  one  final  assumption,  and  one  final  transformation,  before  extracting  the 
neural  Hamiltonian,  G. 


EQN(16)  is  an  equation  that  is  valid  for  quasi-stationary  inputs.  To  have  a  dynamical 
equation  for  neural  responses  that  have  non-stationary  inputs,  Cowan  uses  the  following  ordinary 
non-linear  differential  equation  in  the  x  variable, 


This  equation,  which  is  a  dynamical  equation  for  neural  responses,  can  also  be  written,  with  the 
subsidiary  conditions  that  +  aji  =  0  and  an  =  0,  as 


(18) 


where  6,«,-  =  c,-.  In  obtaining  EQN(18),  Cowan  neglects  the  second  term  on  the  left-hand  side  of 
EQN(17),  which  he  refers  to  as  the  damping  term.  Now,  we  are  in  a  position  to  consider  easily 
the  stationary  states  o{  the  system.  As  is  evident  from  EQN(18),  there  are  three  such  stationary 
states:  x  =  0,  i  =  1  and 


i 


(19) 
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Eventually,  it  is  shown  that  the  final  ensemble  average  'xi  and  final  time  average  (zi)  of  the 
sensitivity,  z,,  are  both  equal  to  the  stationary  state,  g,,  defined  in  EQN(19).  This  verifies  the 
property  of  ergodicity  for  this  system:  z7  =  (z,).  In  fact,  adl  ensemble  averages  may  be  replaced 
by  time  averages  [10]. 


With  the  following  transformation  equations 


(20) 

Vl  -  XiJ 

T^tfr 

(21) 

the  Hamiiioiiian,  G,  finally  emerges 

dvi  /  Oij  \  dG 

dT- 2^  \l3il3j  dvj 

(22) 

G  =  ^  A  [ln(H-  g,  exp  Vi)  -  ?<v.] . 

(23) 

The  “labor”  is  essentially  completed  with  the  acquisition  of  the  system  Hamiltonian,  the  subse¬ 
quent  statistical  mechanics  is  straightforward.  If  the  corresponding  canonical  density  is  defined 
as 


0  =  0  exp 


(24) 


then  appropriate  ensemble  averages  for  the  system  may  be  considered.  Whereas  Cowan  calcu¬ 
lates  these  explicitly,  we  will  merely  reference  a  few  of  the  significant  results  for  demonstrative 
purposes. 


One  important  parameter  to  derive  is  the  temperatcre,  defined  by  Kerner  in  a  previous 
work  as:  a  measure,  in  one  number  common  to  all  the  species  specifying  the  mean  square 
deviations  of  the  Nr  (population  number)  from  their  average  values  [11].  Given,  that 


and 


it  can  be  shown  that 


ac 

dvi 


=  ^(*1  -  ?<). 


~bg  1 

^'dVi  ~  b’ 


(25) 

(26) 
(27) 
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and 


(IT 


(1  +  (i)A) 


(28) 


Solving  for  the  quantity  (J-),  which  he  sets  equal  to  6,  Cowan  gets  the  temperature  or  kinetic 
energy  of  the  net,  which  is  equipartitioned,  or  distributed  evenly,  throughout  the  net.  Thus, 


Pi 


{xj  - 
9i(l  -  ?.) 


PK^i-qi?\ 


(29) 


If  the  Hamiltonian,  G,  is  seen  as  the  activity  of  the  network,  then  this  temperature  term  is 
considered  to  be  the  amplitude  of  fluctuation  of  the  activity.  To  complete  the  interpretation  of 
the  camonical  density,  we  note  that,  by  definition 


/-hoo  ^+00 

I  ■■■I  €^~^^*^dvidv2  ■■  -dv^k 

‘OO  .  •/-‘OO 

/'+®° 

=  11/ 

j  J—oo 

with  B{p,  q)  representing  Euler’s  /3-function, 


(30) 


From  this  canonical  density,  Cowan  proceeds  to  derive  the  moments  of  the  /3-density, 
the  mean,  the  variance,  the  time  spent  above  and  below  the  mean,  the  average  crossin</-rafe  at 
whicL  neural  fluctuations  occur,  and  the  cross-correlation  functions  of  the  joint  activity  of  two 
or  more  cells.  Finally,  he  gives  some  practical  applications  with  regard  to  the  thalamo-cortical 
interaction^  which  he  sites  in  proof  of  his  work. 

A  couple  of  comments  require  mention  at  this  point.  The  mathematics  that  follows 
from  the  dynamics  of  Cowan,  as  outlined  here,  carry  three  implicit  assumptions  which  deserve 
attention.  First,  as  introduced  prior,  there  is  the  matter  of  the  single  or  impulsive  vs.  pawdcets-of- 
pulses  or  bursting  nature  of  the  action  potential,  as  seen  by  referencing  EQN(3).  In  addition  to 
the  supposition  of  a  single-pulse  character  for  action  potentials,  Cowan  makes  another  assumption 
not  cleanly  supported  by  biological  knowledge.  There  is  no  accounting  in  Cowan’s  kinematics 
for  variable  delay  times  as  he  considers  the  transmission  of  signal  from  neuron  to  neuron  to  be 
instantaneous.  Finally,  c.f,  EQN(18),  Cowan  models  tome  cells,  that  give  maintained  responses 
to  constant  stimuli  and  not  phasic  cells,  which  respond  only  to  changing  stimuli  [12|. 
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The  latter  of  these  issues  was  to  be  addressed  by  Cowan  subsequently  and  will  not 
be  the  focus  of  this  intended  investigation.  The  former  two  concerns,  that  of  “bursting”  and 
“transmission  delays,”  as  we  shall  call  the  two  phenomena,  will  be  pursued  presently. 


2.  Non-Equilibrium  Models: 

As  indicated  previously,  alterations  that  we  make  to  Cowan’s  model  lead  eventually  to  an  analysis 
that  requires  more  sophistication  in  the  mathonatical  techniques  employed.  Although  our  initial 
analysis  remains  at  the  level  of  equilibrium  statistical  mechanics,  we  incluu '  it  in  this  section 
to  emphasize  the  direction  that  our  mathematics  will  ultimately  take.  Fortunately,  many  of  the 
tools  available  in  non-equilibrium  statistical  mechanical  theory  equip  us  well  for  the  mathematical 
intricacy  that  evolves  from  our  approach. 


2.1  -  Our  Corrections  to  Cowsm’s  SMN  Model 

Remediation  of  Cowan’s  assumption  of  Heaviside-Dirac  pulses  and  neuron-neuron  conomunica- 
tion,  together  with  his  omission  of  variable  transmission  delay  time  between  neurons  provides  us 
a  working  goaL  Toward  this  end,  we  wiU  make  our  own  set  of  postulates  that  we  feel  lead  to 
more  biologically  accurate  models.  Of  these,  we  feel  our  most  critical  alteration  is  the  idea  that 
the  introduction  of  a  logistic  fimction  into  the  dynamics  be  postponed  until  after  the  definition 
of  the  operational  kinematics  of  the  individual  neurons.  This  makes  the  entire  derivation  less 
heuristic  and  therefore  more  exact.  To  see  this  quantitatively,  let  us  compare  the  derivation  of 
Cowan,  and  in  particular  £QN(9)  and  EQN(17),  with  the  following,  which  we  propose. 


Figure  2:  Model  for  the  Neuron  as  a  Summatot. 
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Beginniag,  as  did  Cowan,  with  a  model  of  the  neuron  as,  essentially,  a  surnmator: 
(see  Figure  2)  taking  the  collection  of  combined  inhibitory  and  excitatory  signals  that  impinge 
upon  its  cell  body,  or  soma,  from  all  antecedent  cells  (a  process  known  as  convergence),  adding 
them  together  by  integration,  and  transmitting  the  singular  result  through  its  axon  to  all  sequen¬ 
tial  cells  (divergence)  [13],  we  perceive  of  a  situation  where  the  details  of  the  manner  in  which 
integration  and  cell  thresholding  occurs  is  critical. 

Considering  the  cell  body  membrane  system  to  be  linear,  the  membrane  potential  com¬ 
parable  to  EQN(4)  in  our  scheme  is 

/+00  'I  f+OO 

9it-T)ij(T)dT  (31) 

■00 

where  for  analytical  work 


g{t  -  r)  =  exp  -l(t  -  r)  (32) 

Here  we  are  representing  actual  action  potential  integration  rather  than  following  Cowan  who 
used  a  firing  rate  relationship  based  on  the  assumption  of  closely  spaced  Heaviside- Dirac  pulses. 

The  cell  body  releases  the  action  potential  once  the  membrane  potential  reaches  a  critical 
value,  Vc,  changing  the  cell’s  conductivity  to  ions  which  respond  by  moving  through  the  prescribed 
gates  in  the  membrane  and  as  a  result  change  the  potential.  If  we  let  C(i)  be  the  concentration 
of  ions  in  the  cell  at  a  given  time 

^  =  -{ko  +  kiTiMt)  -  vMC(t)  -  Ce]  -  hlC(t)  -  Ch],  (33) 

with  ko,  ki,  and  h  being  positive  coefficients  and  with  H  being  the  Heaviside  Step  Function, 
estimated  for  analytical  work  by 


K'>m{t)  -  Vc]  = 


1  -|-exp-,3[vm(0  -  Vc]' 


Whenever  UmCO  greater  than  Vc,  C(t)  becomes  Cr  or  Ce  respectively. 

With  this  prescription,  action  potential-like  behavior  is  obtained  since  the  change  in  concentration 
of  the  ions  in  the  cell,  is  seen  to  be  the  membrane  current,  /^(O*  The  following  equations 


dvm 

dt 


+  lVm 


(35) 
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(36) 


^  +  {*0  +  ifci(l  +  exp  -0[v„it)  -  v,])-‘}(C  -  CbI  +  h[C  -  C«]  =  0 


(37) 


deilne  a  system  of  equations  resulting  from  the  preceding  set  of  assumptions.  These  may  be 
manipulated  further  realizing  that  for  various  neurons:  0^  Vc,  ka,  ki,  h,  Ce-,  nnd  Cr  may  vary 
statistically. 

As  can  be  seen,  this  approach  is  more  profitable  than  Cowan’s  in  that  the  foundation 
that  had  been  developed  heuristically,  EQN(l),  has  now  been  made  more  rigorous.  Our  EQN(33) 
is  not  ad  hoc  <ind  specifies  more  precisely  the  response  of  a  realistic  neuronal  membrame.  Now, 
Cowan’s  hypothesis  of  pulse  summation  at  the  neuron  and  logistic  rate  relation  can  be  tested 
from  more  fundamental  starting  points. 


2.2  •  Future  Directions 

There  remains  the  as  of  yet  unexplained  phenomena  of  “bursting”  that  demands  to  be  addresssed. 
The  all-or-nothing  description  of  the  emission  of  pulses  is  understood  to  apply  to  the  emission  of 
single  pulses  and  is  therefore  an  oversimplification  of  the  true  process.  In  reality,  neurons  emit 
pulses  continually,  and  it  is  the  frequency  of  pulse  emission  that  increases  significantly  when  the 
neuron  becomes  activated.  It  is  this  sequence  of  single  pulses  (or  bursts)  in  quick  succesion,  or 
rather  the  frequency  of  these  bursts  that  is  the  unit  of  interest  [14]. 

To  quantify  this  situation,  we  look  to  the  discovery  of  Weber  who  noticed  that  the  small¬ 
est  perceptible  difference  in  intensity.  A/,  is  a  constant,  called  the  Weber  Constant,  equal  to 
To  this  contribution,  Fechner  added  a  relationship  between  “stimulus"  and  “sensation”  and 
derived  the  Weber-Fechner  law 


Sensation  Klofl  +  C  (38) 

stating  that  discriminable  units  are  increments  of  sensation.  Thus,  there  is  a  logarithmic  encoding 
of  intensities  being  performed  in  cells  [15].  An  important  distinction  becomes  notable  when 
extending  these  models  of  tonic  cells  to  include  phasic  cells,  another  important  question  for  a 
future  study.  Whereas  for  tonic  receptors  this  law,  EQN(38),  is  a  fair  approximation,  for  phasic 
receptors  it  does  not  hold.  The  discharge  is  limited  by  prompt  adaptation  [16]. 
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The  inclusion  of  phasic  cells  in  the  statistical  mechanical  model  is  but  one  of  the  potential 
areas  opened  up  to  Investigation  by  a  more  complete  understanding  of  our  work.  Table  I  below, 
from  NeurtU  Netwoii:  Architectures  by  Judith  DayhofF,  lists  several  more  places  where  further 
study  is  needed. 


Biological  Neural  Networks  Artificial  Neural  Networks 


Synapses  complex 

Fixed  gross  wiring  struaure 
plus  variation  in  deuiied 
struaure 

Pulse  transmission 

Topological  mappings 

Distributed  representations  and 
processing 

Redundancy 

Feature  deteaors 

Learning  as  fast  as  one  pass 

100  billion  neurons 

Estimated  10,000  interconnec¬ 
tions  per  neuron 

Continuous  or  asynchronous 
updating 


Synapses  simple 

Usually  fully  interconneaed 
slabs 

Activity  value  and  conneaion 
streng^ 

Kohonen  feature  map 

Distributed  representations  and 
processing 

Redundancy 

Feature  deteaors 

Slow  to  converge 

Usually  up  to  hundreds  or 
thousands  of  neurons 

Usually  10»  10,000  intercon¬ 
nects  per  neuron 

Generally  synchronous  updating 


Table  1:  A  Comparison  of  Biological  and  Artificial  Nevral  Networks.  (Source:  Dayholf,  J.E.  1990,  Neural 
Network  Architectures:  An  Introduction,  Van  Nostrand  Reinhold:  New  York,  p.  158.) 

This  table  collects  a  few  of  the  differeneces  between  artificial  and  biological  neural 
networks  and  provides  great  incentive  for  those  attempting  to  increase  the  biological  acuracy  of 
current  neural  network  models.  The  difficulty  lies  in  retaining  mathematical  tractabillty  with 
the  increase  in  mathematical  complex  ’,,  but  the  reward  is  invaluable. 

Anothtt  important  direction  to  be  pursued  includes  the  explanation  of  various  features  of 
the  active  state  of  the  brain,  the  active  EEG.  The  Visual  (VER)  and  Auditory  Evoked  Responses 
(A£R)  are  phenomena  that  occur  in  reaction  to  external  stimuli  independent  of  the  resting 
state  and  are  prime  candidates  for  necessitating  the  introduction  of  non-equilibrium  statistical 
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mechanics.  Both  small  and  large  perturbations  from  equilibrium  could  be  explored,  invoking 
Kubo  Theory,  Fokker- Planck  formulations  to  indicate  how  the  system  responds  to  input  and  how 
it  handles  input-ouput  relations.  In  short,  the  VER  and  AER  are  problems  in  non-equilibrium 
statistical  mechanics  of  neural  nets  [17]. 

2.3  -  Applications  to  Health  and  Safety 

In  the  endeavor  to  explain  the  resting  EEG,  the  development  of  neural  network  models  that 
are  more  biologically-accuratc  is  crucial.  If  we  can  train  these  “biophysical  neural  networks,”  to 
correctly  predict  the  electric  fields  coming  from  cells  in  the  brain  while  it  is  in  its  resting  state, 
then  we  will  eventually  be  able  to  decipher  more  easily  the  electric  field  emissions  from  the  brain 
in  its  active  state.  This  knowledge  will  assist  with  medical  diagnostic  EEG  analysis  and  should 
permit  consideration  of  the  effects  of  applied  external  fields  on  the  resting  and  active  EEG. 
Knowing  with  certainty  the  expected  electric  fields  of  the  resting  or  active  brain  independent  of 
external  effects  is  vital  in  isolating  the  effects  of  external  fields  since  the  external  fields  may  be 
simply  superimposed  over  endrogenous  membrane  potentials. 

The  level  of  danger  presented  to  human  beings  by  low-level  stray  electric  fields  in  the 
environment  is  unknown  at  present.  Ultimately,  showing  and  explaining  the  interaction  of  the 
brain’s  electric  field  with  applied  fields  is  our  interest.  To  determine  to  what  degree  these  artificial 
environmental  fields  effect  damage  to  learning  and  the  degradation  of  memory  is  an  important 
occupational  and  environmental  medical  pursuit,  one  that  our  proposed  work  is  designed  to 
advance. 


Conclusion: 

Neawly  7  decades  has  elapsed  since  the  introduction  of  the  suggestion  by  Lotka  (1925)  to  encor- 
porate  statistical  mechanics  rigorously  into  our  theories  of  biological  and  neural  modeling,  and 
yet  there  remains  a  multitude  of  unpursued  avenues  and  unanswered  questions  with  regard  to 
this  pursuit.  The  work  of  Volterra  (1931,  1937),  Rashevsky  (1938),  Kerner  (1957,  1959,  1961, 
1964),  and  Cowan  (1970)  has  laid  the  groundwork.  We  are  in  a  postion  to  more  fully  answer 
questions  about  the  human  brain  from  an  important  perspective.  It  is  our  ultimate  hope  to 
thereby  illuminate  some  of  the  mystery  surrounding  the  risk  to  man  posed  by  external  factors  in 
our  environment. 
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Abstract 

The  mathematical  basis  of  the  multivariate  risk  screening  procedure  is 
that,  though  various  nonrelated  data  originate  from  the  same  source  and  add  no 
more  to  the  modification  of  the  established  odds  than  a  single  datum  from  the 
same  source,  these  multiple  independent  parameters  do,  however,  improve 
success.  Currently,  a  quantitative  biochemical  Alpha-fetoprotein  procedure  is 
used.  However,  according  to  the  Foundation  for  Blood  Research  (FBR),  it  is 
far  less  accurate  than  the  proposed  AFP  diagnostic  profile  procedure,  which 
not  only  tests  quantities  of  Maternal  Serum  Alpha-Fetoprotein  (MSAFP),  but 
also,  unconjugated  Estriol  (uE^)#  as  well  as  Human  Chronic  Gonadotropin  (HCG). 
On  the  other  hand,  the  new  procedure  has  not  been  approved  by  the  ACOG 
(American  College  of  Obstetrics  and  Gynecologists),  even  though  the  FBR 
asserts  that  the  new  profile  test  is  three  times  more  accurate  in  predicting 
Down  Syndrome. 

My  mission  at  Brooks  Air  Force  Base,  San  Antonio,  Texas,  in  the 
Epidemiologic  Research  Division,  was  to  reinterpret  the  multivariate  algorithm 
submitted  by  the  Foundation  for  Blood  Research  (FBR)  so  that  the  new  screening 
procedure  could  be  implemented  into  Armstrong's  existing  laboratory  data  base. 
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ASSESSING  THE  IMPLEMENTATION  OF  A 
MULTIVARIATE  RISK  SCREENING  PROCEDURE 
IN  THE  DIAGNOSING  OF  DOWN  SYNDROME  ASSOCIATED  PREGNANCY  (DSAP) 


WILLIAM  R.  FLETCHER,  JR. 


INTRODUCTION; 

Down  Syndrcrae  (DS)  is  a  variant  of  nature.  Defined  by  the  Encyclopedia 
Britannica,  ”Down  Syndrome  was  the  first  autosomal  abnormality  to  be  described 
in  man.  All  persons  afflicted  with  this  disorder  have  all  (or  moat)  of  the 
set  of  chromosomes  conventionally  labeled  21  represented  triply  rather  than 
doubly,  so  that  there  are  147,  rather  than  the  normal  46,  chromosomes  in  most 
cells.”  (See  Fig.  1). 
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"As  the  most  ccmmon  autosomal  aneuploidy  (incidence  at  birth  -  1.3  per  1,000) 
Down  Syndrome  has  a  significant  survival  rate  beyond  infancy.  The  disorder 
has  been  identified  in  all  races  (the  term  mongolism  was  used  because  the  up- 
slanted  eyes  and  epicanthal  folds  gave  the  appearance  of  an  Oriental  face)  and 
usually  occurs  among  the  firstborn  of  women  over  40  years  of  age.  More  than 
half  of  the  affected  children  die  during  their  first  year,  usually  from  heart 
defects  or  from  infection.  Life  span,  if  the  infant  survives  the  first  year, 
may  be  relatively  normal,  although  sinus  and  lung  infections  are  a  recurrent 
hazard.  Severe  mental  retardation  requires  special  training,  and  individuals 
so  affected  do  not  ordinarily  become  self-supporting;  they  are,  nevertheless, 
unusually  sociable  and  affectionate."  (1) 
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"Prenatal  diagnosis  is  made  by  examination  of  fetal  or  placental 
chromosomes  from  sampling  amniotio  fluid  (amniocentesis),  from  biopsy  of  the 
placenta  [chorionic  villi  sampling  (CVS)],  and,  less  frequently,  umbilical 
cord  blood  sampling.  CVS  is  usually  performed  between  8  weeks  and  12  weeks 
gestational  age  (G.A.);  amniocentesis,  12  weeks  G.A.  Complications  (e.g., 
fetal  death)  can  occur  from  these  invasive  procedures;  however,  risk  is 
relatively  low  in  the  U.S.  (e.g.,  number  of  fetal  deaths  from  amniocentesis  - 
approximately  0.2%  to  0.^4$).  Nevertheless,  these  procedures  are  performed 
when  the  risk  of  Down  Syndrome  or  other  abnormality  is  sufficiently  high  to 
justify  them.  Definition  of  sufficiently  high  risk  is  subjective  and  in  1983, 
the  President's  Commission  for  the  Study  of  Ethical  Problems  in  Medicine  and 
Biomedical  and  Behavioral  Sciences  recommended  that  genetic  amniocentesis  be 
made  available  to  all  pregnant  women.  In  practice,  due  to  cost  (approximately 
$1,000),  labor-intensity,  and  technical  (e.g.,  shortage  of  cytotechnologists) 
constraints,  most  physicians  recommend  amniocentesis  for  those  women  and  DS 
risk  at  term  >  1:365  [1:386;  the  risk  at  maternal  age  (M.A.)  -  35  years],  or 
2nd  trimester  risk  >  1:270  (1:2744).  Thus,  non-invasive ,  biochemical  screening 
tests  [e.g.,  maternal  serum  alpha-fetoprotein  (MSAFP),  human  chorionic 
gonadotropin  (hCG),  unconjugated  estriol  (uE3),  pregnancy-specific  Bi¬ 
glycoprotein  (SP1)]  that  can  provide  better  assessment  of  Down  Syndrome  risk 
than  M.A.  alone  are  desirable  by  the  medical  community."  (2) 

"The  AFP  Profile  or  alpha-fetoprotein-triple  marker  procedure  (AFP-TMP)  is  a 
group  of  tests,  performed  on  a  single  maternal  serum  sample,  which  will  screen 
for  open  fetal  defects,  such  as  spina  bifida,  anencephaly,  and  abdominal  wall 
defects,  and  also  for  fetal  trisomy  21  (Down  Syndrome).  Along  with  maternal 
serum  alpha-fetoprotein  (MSAFP),  two  additional  biochemical  markers, 
unconjugated  estriol  (uE3)  and  human  chorionic  gonadotropin  (hCG),  are 
measured  to  calculate  an  increase  or  decrease  in  the  patient's  age-related 
risk  for  Down  Syndrome.  Compared  to  using  age  35  alone  as  an  indication  of 
risk,  or  low  MSAFP  and  age  alone  in  the  under  35  population,  the  AFP  Profile 
can  triple  the  number  of  cases  of  fetal  Down  Syndrome  detected  (from  about  20% 
to  60$)  following  amniocentesis  on  about  5%  of  screened  women. 

The  AFP-TMP  is  just  as  reliable  as  routine  MSAFP  screening  for  identifying 
open  neural  tube  defects;  the  MSAFP  level  is  still  used  by  itself  to  determine 
risk  of  open  fetal  defects,  and  the  protocols  and  recommendations  associated 
with  elevated  MSAFP  and  open  fetal  defects  are  unchanged.  The  additional 
markers  -  uE3  and  hCG  -  significantly  enhance  the  sensitivity  of  Down  Syndrome 
screening."  (3) 

As  reported  by  Dr  J.  Canick,  "with  the  discovery  that  pregnancies  affected 
with  Down  Syndrome  are  associated  with  lower  levels  of  MSAFP,  we  postulated 
that  the  levels  of  other  biochemical  markers  of  fetoplacental  origin  might 
also  be  abnormal  in  Down  Syndrome  pregnancy.  Alpha-fetoprotein  (AFP)  is  made 
primarily  in  the  fetal  liver  during  the  second  trimester  of  pregnancy  and 
indirect  data  indicated  that  the  livers  of  fetuses  affected  with  Down  Syndrome 
might  be  producing  less  than  normal  amounts  of  AFP.  The  fetal  liver  also 
participates  in  the  synthesis  of  the  fetoplacental  steroid  hormone,  estriol. 
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Therefore,  maternal  serum  unconjugated  estriol  (uE3)  might  also  be  lower  than 
normal  in  pregnancies  affected  with  fetal  Down  Syndrome  and  might  also  provide 
useful  clinical  information  for  prenatal  screening.  A  research  collaboration 
between  Women  &  Infants  Hospital,  the  Foundation  for  Blood  Research,  and  the 
Medical  College  of  St.  Bartholomew's  Hospital  showed  that  maternal  serum  uE3 
is,  in  fact,  low  in  Down  Syndrome  pregnancy  and  that  the  measurement  of  uE3 
along  with  AFP  substantially  enhances  screening  sensitivity.”  (^) 

"Researchers  at  the  University  of  California,  San  Diego,  demonstrated  at  about 
the  same  time  that  maternal  serum  levels  of  the  placental  hormone,  human 
chorionic  gonadotropin  (HCG),  were  also  abnormal  in  pregnancies  affected  with 
fetal  Down  Syndrome  (3).  Interestingly,  HCG  levels  were  higher  than  normal  in 
such  cases.  Immediately  following  this  discovery,  we  were  able  to  demonstrate 
that  the  measurement  of  maternal  serum  HCG  in  conjunction  with  uE3  and  AFP 
could  better  define  the  patient-specific  risk  for  fetal  Down  syndrome  than  any 
method  previously  described."  (5) 

DISCUSSION  OF  PROBLEM 


The  problem  at  Brooks  Air  Force  Base's  Epidemiologic  Research  Division 
(AOE),  is  also  that  of  variance.  Requests  for  the  AFP  triple-marker  system 
has  been  Increasing  significantly,  and  AOE's  Clinical  Chemistry  Laboratory  has 
been  considering  a  change  to  the  proposed  system  for  at  least  6  months.  On 
the  other  hand,  because  of  a  shortage  of  manpower  and  managerial  transition 
(NCOIC,  as  well  as  OIC)  very  little  time  could  be  allotted  to  do  the  necessary 
research  required.  Moreover,  AOE  had  no  existing  system  that  would  actively 
involve  the  OB/GYN  physicians  from  Wilford  Hall  USAF  Medical  Center  in  the 
implementation  of  the  triple-marker  AFP  profile.  Yet,  having  the  reputation 
of  being  a  state-of-the-art  reference  laboratory,  AOE  could  ill  afford  to 
overlook  the  claims  of  various  biomedical  scientists,  that  the  AFP  triple¬ 
marker  is  50*300  percent  more  accurate  in  predicting  Down  Syndrome  than  the 
current  Maternal  Age  (MA)-MSAFP  diagnostic  system. 

Cost  is  another  consideration.  AOE  is  on  a  "tight"  budget.  Thus, 
perhaps,  the  cost  of  such  a  system  might  be  prohibitive.  And,  feasibility,  is 
the  instrumentation  readily  available  and  could  they  possibly  write  the 
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software  for  the  triple-marker  system,  in  house,  utilizing  a  mathematician? 
Finally,  what  variances  would  be  considered  in  the  system  redesign? 

METHODOLOGY 


The  variances  in  this  case  are  caused  by  AOE’s  socio-technical 
environment.  (See  Fig.  2).  Indeed,  the  emergence  of  AOE's  desire  to  meet  the 
demand  for  the  triple-marker  profile  is  a  typical  case  of  change  being  brought 
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Analysis  of  the  sociological  forces  bringing  about  AQE's  need  to  change  Its 
current  MSAFP  system; 

*  More  military  and/or  professional  women  are  having  children  at  a  later  age. 

*  Stiffer  abortion  laws  have  increased  the  patients  need-to-know  (as  to 
whether  or  not  their  baby  will  have  DS,  and,  if  so,  when  and  where  should  they 
have  an  abortion)  -  if  they  make  such  a  choice. 

*  As  a  result,  in  order  to  escape  legal  entanglement,  more  military  doctors 
are  requesting  the  triple-marker  AFP  profile  for  DS. 

*  Like  any  other  top  international,  U.S. -based  corporation,  the  USAF  depends 
heavily  on  "leading  edge  technology."  Consequently,  in  keeping  with  the  USAF 
"image,"  AOE  must  continue  to  show  that,  like  its  civilian  counterparts,  it 
too  can  provide  state-of-the-art  medical  information,  quickly,  efficiently, 
but  at  a  lower  economic  cost. 

*  Indeed,  the  Japanese  have  shown  that,  in  order  to  be  a  competitive  "world- 
player,"  organizational  group  participation  is  a  must.  AOE  seems  to  be 
courting  this  concept  in  its  effort  to  involve  the  Wilford  Hall  OB/GYN  medical 
group  with  the  triple  marker  program.  By  involving  the  physicians,  as  well 
as,  possibly,  getting  their  business,  AOE  will  further  entrench  their 
existence  with  the  "new"  Air  Force. 

Analysis  of  the  technical  forces  bring  about  AOE's  need  to  change  its  current 
MSAFP  system; 

*  Until  biochemical  screening  became  available,  the  most  frequent  method  for 
selecting  women  at  increased  risk  for  DS  was  maternal  age. 

*  In  1984,  AFP  was  reported  to  be  lower  than  average  in  maternaLL  serum 
associated  with  DS  pregnancy.  The  discovery  was  found  to  be  independent  of 
maternal  age. 

*  In  1987,  Bogart  and  Company  at  the  University  of  California,  San  Diego, 
reported  an  association  between  abnormal  human  chorionic  gonadotropin  (hCG) 
levels  and  pregnancies  with  fetal  aneuploidy. 

*  "Furthermore,  in  1987,  it  was  discovered  that  the  fetal  liver  also 
participates  in  the  synthesis  of  the  fetoplacental  steroid  hormone,  estriol 
(uE3).  Therefore,  maternal  serum  uE3  might  also  be  lower  than  average  in 
pregnancies  affected  with  fetal  Down  Syndrome  and  might  also  provide  useful 
clinical  information  for  prenatal  screening.  In  collaboration  with  the 
Foundation  for  Blood  Research  in  Scarborough,  Maine,  and  the  Medical  College 
of  St.  Bartholomew’s  Hospital  in  London,  the  WIH  group  showed  that  maternal 
serum  uE3  is,  in  fact,  low  and  that  the  measurement  of  uE3  along  with  AFP 
substantially  enhances  screening  sensitivity."  (6) 
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*  "Again  in  1987,  several  groups  described  methods  of  screening  pregnant  wcrnien 
for  the  risk  of  carrying  a  fetus  with  Down  Syndrome;  the  screens  utilized 
maternal  age  at  expected  date  of  delivery  (EDD)  and  the  measurement  of 
maternal  serum  a-fetoprotein  (raAFP).  Cuckle  et  al.  calculated  that  applying 
such  a  screening  procedure,  with  all  pregnancies  having  a  risk  greater  than 
1:250  receiving  amniocentesis,  would  correctly  detect  281S  of  affected 
pregnancies  with  2.6%  of  non-affected  pregnancies  being  detected  as  false 
positives.  Although  only  a  small  proportion  of  Down’s  associated  pregnancies 
could  be  detected  with  an  acceptable  false  positive  rate,  this  did  represent 
an  improvement  over  methods  of  screening  not  including  maternal  age  within  the 
risk  calculation."  (7) 

*  "In  1988  Wald  published  a  further  refinement  of  the  method  of  screening 
using  maternal  age  and  three  analytes:  mAFP,  human  chorionic  gonadotrophin 
(HCG)  and  serum  unconjugated  oestriol  (uE-,).  By  selecting  decision  limits 
such  that  all  women  with  a  risk  greater  than  1:250  would  be  offered 
amniocentesis.  It  was  demonstrated  that  a  direction  in  excess  of  60%  of 
Down's  associated  pregnancies  was  possible  whilst  maintaining  an  overall 
amniocentesis  rate  of  about  5%,  a  rate  no  higher  than  that  currently  achieved 
by  centers  screening  for  Down  Syndrome  solely  on  the  basis  of  maternal  age 

36  years  or  over."  (8) 

RESULTS : 

As  a  result,  AOE  had  to  become  a  "risk-taker ."  Sociological  and 
technological  advances  have  brought  about  the  emergence  of  a  AFP-Triple  Marker 
Profile  (AFP-TMP).  With  the  notoriety  of  being  an  "advanced"  reference 
laboratory,  AOE  could  not  afford  to  overlook  a  procedure  that  is, 
experimentally,  50-300%  more  accurate  in  predicting  Down  Syndrome  Associated 
Pregnancies  (DSAP)  than  the  current  Maternal  Age  (MA)-MSAFP  diagnostic  system. 
Thus,  initial  understanding  of  the  mathematics  behind  the  (AFP-TMP)  was 
necessary. 


UEFIMTIONS 

Risk  rat:„  (or  odds  ratio);  A  ratio  of  I  :n  implieii 
that  for  n  trials  there  is  I  chance  of  a  specified 
event  occurring. 

Odds  modiher  (or  likelihood  ratio):  This  is  a 
factor  which  is  used  to  modify  a  nsk  ratio.  For 
example  if  a  risk  ratio  of  I ;  20  is  modified  by  an 
odds  modifier  of  3  the  modified  risk  ratio  is  1 :60. 

Increase  in  risk;  Increased  risk  means  that 
there  is  a  greater  probability  that  an  event  will 
occur.  For  example  a  risk  of  I  ;200  is  greater  than 
a  nsk  of  l;300. 

Decrease  in  risk:  From  the  above  a  risk  of 
1:300  is  less  than  a  nsk  of  1:^. 


Fig.  3  Matrix  madwiBatics 

M-'/w;  is  a  matrix? 

A  matrix  is  a  group  of  numbers  in  a  specific 
format  which  interacts  with  other  such  sets  of 
numbers  in  a  very  speaiic  way.  Matrices  are 
formed  from  a  number  of  rows  and  columns.  For 
a  multivariate  Gaussian  distnbution  the  core  of 
the  formula  is  the  mainx  V  which  has  the  same 
number  of  rows  as  columns. 

Matrix  notation: 

X  :  The  line  beneath  the  vanable  name  sig¬ 
nifies  that  It  is  in  matnx  format.  X  is  the 
mainx  containing  ail  the  patient  s 
parameters.  The  matnx  ^  contains  the 
population  means. 

1 2-8  y  ■ '  ;  This  is  the  inverse  of  the  matrix  V.  This 

concept  will  be  explained  later, 
ly  ‘  'I :  This  is  the  determinant  of  a  matrix.  This 
will  be  explained  later. 


"The  risk  of  Down  Syndrome  in  relation  to  maternal  age  at  EDD  has  been 
calculated  by  Cuckle.  This  risk  is  expressed  as  an  odds  ratio.  Information 
concerning  the  progress  of  the  pregnancy  independent  of  maternal  age  is  then 
used  to  modify  this  ratio.  Providing  the  Gaussian  distributions  for  a  given 
parameter  are  known  for  both  unaffected  and  Down  Syndrome  pregnancies,  then 
for  a  given  parameter  value  the  chance  of  a  Down  Syndrome  associated  pregnancy 
is  the  ratio  of  the  heights  of  the  Gaussian  distribution  for  Down  Syndrome 
pregnancies  and  unaffected  pregnancies,  respectively:  this  has  been  termed  the 
likelihood  ratio  by  Wald.  It  may  also  be  called  the  odds  modifier;  i.e., 
calculation  of  age  related  risk: 

The  formula  for  calculation  of  maternal  age  related  risk  is: 

Ritk  « 

I)V*W37?  162395  o;86- Maternal  Axil 

0  0006:7  -  e*p((-  16  2395  +  0  286-M4ieni«t  Ageij 


This  formula  gives  a  value  for  n  such  that  Risk  -  1:n. 

Data  required  for  screening  test:  parameters  used  by  Wald  et  al.  were  Log^g  of 
the  multiple  of  the  median  for  mAFP,  HCG  and  the  multiple  of  the  median  for 
uE3.  Depending  on  the  number  of  parameters,  the  appropriate  Gaussian 
distribution  function  should  be  used.  For  one  or  two  parameters  the  functions 
may  be  entered  directly.  For  three  parameters  there  is  some  precalculation 
required.  The  matrix  VL  its  inverse  and  the  determinant  of  the  inverse  must 
be  evaluated  for  both  normal  and  Down's  associated  pregnancies.  They  need 
only  be  calculated  once  as  they  may  then  be  stored  in  the  computer  as 
constants.  The  V  matrix  is  composed  of  the  population  standard  deviation 
values  for  each  parameter  and  the  correlation  coefficients  between  each 
parameter."  (9) 

V  = 

/  <7,,- 

p.,x:a„a„  5.,- 

\p..x,c,o„  p„.x,a,.c„  <T..-  j 


"For  example  the  second  element  of  the  top  row  of  the  illustrated  matrix  is 
the  product  of  the  correlation  coefficient  relating  parameters  1  and  2  with 
the  standard  deviations  of  parameter  1  and  2. 


The  patients'  data  are  entered  into  the  normal  distribution  function  twice. 
Once  using  population  data  derived  from  Down's  associated  pregnancies,  and 


once  using  the  data  from  normal  pregnancies.  These  two  values  ^downs 
^normal  calculate  the  odds  modifier 

■  ^normal 
^ downs 


and 
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The  final  risk  is  the  product  of  the  age  related  risk  and  the  odds  modifier. 
Thus  in  the  case  of  screening  for  Down's  associated  pregnancy  the  risk  would 
be: 

Risk  -  Age-related  risk  x  «^normal 

^ downs  (10) 

"Whereas  the  estimation  of  racing  odds  is  based  on  subjective  qualitative 
information  and  the  computations  are  intuitive,  based  on  experience,  the 
numerical  data  obtained  from  biochemical  measurement,  or  other  numeric 
parameters  of  pregnancy  influenced  by  Down  Syndrome,  can  be  processed 
mathematically. 

The  Gaussian  distribution  of  a  single  variable  is  visually  presentable  (in  two 
dimensions),  and  the  related  Gaussian  distributions  of  two  variables  are  also 
presentable  in  three  dimensions:  the  introduction  of  more  variables  results 
in  conceptual  difficulty,  which  can  be  overcome  by  mathematical  analysis  using 
matrix  algebra  and  may  be  easily  programmed  in  BASIC  on  any  commercially 
available  personal  computer. 

The  method  of  risk  calculation  is  analogous  to  setting  betting  odds  for  horses 
in  a  race.  Established  odds  (equivalent  to  odds  ratio)  are  based  on  past  form 
(equivalent  to  maternal  age)  but  can  be  modified  by  f\irther  independent 
information  such  as  an  observation  of  current  form  (equivalent  to  mAFP). 
Consistent  success  in  picking  a  winner  requires  a  reliable  source  of 
independent  items  of  information  (equivalent  to  HCG,uE2).  As  with  the  horse 
racing  analogy  many  items  of  apparently  independent  information,  but  from  the 
same  source  add  no  more  to  the  modification  of  the  established  odds  than  a 
single  item  of  information  from  that  source.  However,  multip  e  sources  of 
truly  independent  information  (especially  if  not  available  to  the  bookmaker) 
improve  success."  (11) 

On  the  other  hand,  to  successfully  accommodate  the  variations  of  its  socio- 
technical  environment,  an  organizational  redesign  of  AOE's  DSAP  procedure 
appears  necessary.  AOE's  basic  mission  is  the  providing  of  medical 
information  to  the  user.  Yet,  there  are  social  issues  attached  to  DSAP  that 
AOE  must  fully  discuss,  sooner  or  later,  with  its  users-  the  military 
physician  group,  especially  with  regard  to  the  patient's  need-to-know. 

Indeed,  the  current  MA-MSAFP  system  doesn't  seem  to  account  for  the  increasing 
social  concern  surrounding  DSAP  and  abortion  laws.  This  increase  in  the 
concern  over  DSAP  by  older  professional  civilian  and/or  military  women  and  the 
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ability  to  have  abortions,  could  translate  into  a  need  for  more  accurate  DSAP 
testing.  Already,  more  than  30  commercial  laboratories  are  utilizing  the  AFP- 
TMP  on  a  daily  basis.  On  the  other  hand,  though  it  is  50  percent  more 
sensitive  than  the  current  MA~MSAFP  procedure,  the  AFP-TMP  still  misses  a 
significant  number  of  DSAP’s  (approximately  40  percent).  But  then,  AFP-TMP  is 
an  indirect  biochanical  procedure.  Since  DS  is  an  autoscanal  aneuploldy,  there 
will  probably  evolve  an  alternate  laboratory  procedure  involving  chromosomal 
genotyping  and/or  phenotyping  of  nucleated  fetal  cells  with  approximately  94- 
100  percent  accuracy.  Therefore,  AOE,  in  order  to  successfully  accommodate 
the  variations  of  its  socio-technical  environment,  must  have  an  organizational 
redesign  -  with  respect  to  DSAP  -  that  allows  for  socio-technical  change.  By 
doing  so,  AOE  will  avoid  the  trauma  and  the  inefficiency  of  crisis  management, 
as  well  as  maintain  its  reputation  as  a  state-of-the-art  facility. 

One  way  to  adjust  to  the  socio-technical  "change  process"  is  the  creation 
of  a  "Provider/User"  steering  committee  (see  Fig.  4).  This  committee  should 
meet  periodically  to: 

1.  Discuss  latest  events  in  AFP  testing. 

2.  Insure  the  integrity  of  the  Air  Force  mission  -  the  well  being  of  Air 
Force  personnel  and  their  dependents  by  a  dedicated  commitment  to  quality  with 
respect  to  patient  care.  This  quality  can  be  obtained  by  (a)  setting 
"idealized  goals,"  i.e.,  100  percent  forecasting  for  Down  Syndrome  and  (b) 
perhaps  inviting  participation  of  enlisted  staff  by  allowing  them  to  attend 
and  interact  at  "provider/user"  steering  ccanmittee  meetings  via  offering 
suggestions. 

3.  Design  a  pilot  program  by  which  the  AFP  triple  marker  profile  will  be 
quantitatively  tested  and  assessed, 

4.  Oversee  the  interfacing  of  the  commercial  software  with  the  Laboratory 
Information  Management  System  (LIMS),  as  well  as  how  the  final  LIMS  report 
will  be  reported. 

5.  Assess  the  data  produced  from  the  implementation  process  (as  to  how 
efficient) , 

6.  Analyze  new  techniques  and  new  innovations. 
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DSAP  PROCEDURE  REDESIGN: 


CONCLUSION: 


We  found  three  existing  canmercial  software  packages  which  perforin  the 
AFP-TMP  and  interface  with  AOE's  data  base.  Consequently,  the  mathematical 
basis  of  multivariate  risk  screening  was  discussed  in  order  to  reaffirm, 
theoretically,  that  the  AFP-TMP  ^  more  sensitive  to  DSAP  than  the  current  MA- 
AFP  procedure  (12).  However,  because  of  ever  changing  technical  advances 
(i.e.,  flow  cytometry,  DNA  probes,  etc.)  and,  because  of  various  social  issues 
involved  with  the  AFP-TMP  (i.e.,  abortion,  ACOG  not  yet  approving  the 
procedure  [AFP-TMP]  as  a  diagnostic  standard,  etc.),  an  Organizational 
Redesign,  with  respect  to  DSAP,  was  also  submitted.  This  Organizational 
Redesign  defines  and  procedurally  establishes  a  pilot  program  with  ”idealized" 
managerial  goals  and  expectations  that  coincide  and/or  contain  the  Air  Force 
mission.  Thereby,  providing  a  format  which  not  only  can  quantitatively  test 
the  claims  of  such  research  and  development  groups  as  FBR  but  also  eases  the 
transition  from  an  experimental  stage  into  a  daily  procedural  process  should 
any  future  diagnostic  screening  such  as  AFP-TMP  for  Down  Syndrome  be  deemed 
acceptable  and  feasible  by  the  "provider,"  (Armstrong  Laboratory  management 
and  technical  staff  within  the  Chemistry  Laboratory  at  Brooks  AFB),  as  well  as 
the  "user,"  (military  physicians  from  the  Army,  Navy,  and  Air  Force  bases 
worldwide,  and/or  the  various  medical  groups  affiliated  directly  or  indirectly 
with  Brooks  AFB,  [i.e.,  ACOG]).  Thus,  a  greater  commitment  to  quality  of 
service  is  achieved  when  the  testing  and  implementation  processes  are  more 
defined  and  better  understood  by  all  involved.  As  a  result,  the  following  AOE 
proposal  was  written  for  the  18  June  1992  Wilford  Hall  USAF  Medical  Center 
MSAFP  Committee.  Please  note  the  objective. 


12-13 


PROPOSAL  FOR 


VILFORO  HALL  USAF  MEDICAL  CEMTER 


(MSAFP  COttfITTEE) 


ARMSTRONG  LABORATORY 
EPIDEMIOLOGIC  RESEARCH  DIVISION 
CLINICAL  CHBUSTRY  LABORATORY 
BROOKS  AFB  TX  78235-5000 


OBJECTIVE: 


Review  and  consultation  by  the  MSAFP  Committee  at  Wilford  Hall  USAF  Medical 
Center  regarding  the  implementation  of  a  multivariate  risk  screening  procedure 
in  the  assessment  of  Down  Syndrome  risk. 

INTRODUCTION: 

The  AFP  triple-marker  profile  is  a  group  of  tests,  performed  on  a  single 
maternal  serum  sample,  which  will  screen  for  open  fetal  defects,  such  as  spina 
bifida,  anencephaly,  and  abdominal  wall  defects,  and  also  for  fetal  trisomy  21 
(Down  Syndrome).  Along  with  maternal  serum  alpha-fetoprotein  (MSAFP),  two 
additional  biochemical  markers,  unconjugated  estriol  {uE3)  and  human  chorionic 
gonadotropin  (hCG),  are  measured  to  calculate  an  increase  or  decrease  in  the 
patient’s  age-related  risk  for  carrying  a  Down  Syndrome- affected  fetus. 
(Current  ACOG  guidelines  require  low  MSAFP  and  age  alone  in  the  under  35 
population)  (1). 

DESIRABILITY: 

There  is  an  increasing  demand  for  use  of  the  triple  marker  from  the  field. 

The  Epidemiologic  Research  Division  (AGE)  can't  satisfy  the  demand  for  this 
testing.  (Consequently,  private  commercial  labs  are  being  used,  costing  the 
Air  Force  extra  monies). 

The  AFP  Profile  (triple  marker  profile)  is  Just  as  reliable  as  routine  MSAFP 
screening  for  identifying  open  neural  tube  defects:  the  MSAFP  level  is  still 
used  by  itself  to  determine  risk  of  open  fetal  defects,  and  the  protocols  and 
recommendations  associated  with  elevated  MSAFP  and  open  fetal  defects  are 
unchanged.  However,  the  additional  markers  -  uE3  and  hCC  -  significantly 
enhance  the  sensitivity  of  Down  Syndrome  screening.  (See  Table  1)  (Ref  2). 

Compared  to  using  age  35  alone  as  an  indication  of  risk,  or  low  MSAFP  and  age 
alone  in  the  under  35  population,  the  AFP  Profile  can  triple  the  number  of 
cases  of  fetal  Down  Syndrome  detected  (from  about  20%  to  60t)  (Ref  1). 

COST  ANALYSIS: 


Cost/Pt.^ 

Cost/Month 

Cost/Annum 

Brooks^ 

$15.20 

$15,200 

$182,400 

Commercial  Lab  (i.e.,  FBR)^ 

37.20 

37,200 

446,400 

NOTES : 

^  Based  on  a  projection  of  1 ,000  patients/month 

^  AOE  (maximum  cost/patient  -  includes  commercial  software  package  costa 
amortized  over  a  1  year  period):  $15.20 

^  Commercial  Lab,  i.e..  Foundation  for  Blood  Research  (FBR).  July  1,  1991 
List  Price  $62.00;  Wholesale  Price:  $37.20 
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RESOURCE  REQUIRBffiNTS: 


Instrumentation  -  hCG;  Stratus  II;  MSAFP:  Hybritech  QA;  uE3:  Amersham 
Amerlex  -  MSp  System  (Magnetic  Isotopic 

Software;  "AFP  SMS"  from  Maciel  Assoc. 

Hardware:  2  magnetic  racks  for  the  Amersham 

1  PC  that  is  hard-wired  to  LIMS.  (Ref  3) 

Personnel  Requirements  (earned  FTE's  based  on  CAP  Work  Load  Data  and  a 
manning  factor  of  7,730  CAP  units/FTE/mo);  2.6 

CAP  Points;  4.60  min/test  MSAFP;  4,60  min/test  hCG;  6.0  mln/test  uE3. 

Budget;  (Supply  costs/month)  S  1,300  tests/mo  (including  repeats, 

calibrations,  controls  and  dilutions) 

MSAFP  Kits  $2,795.00 

hCG  Kits  3,065.83 

uE3  Kits  2,080.00 

Cups  90.44 

Tubes  77.82 

Controls  225.00 

$8, 334.09/mo.  for  1,300  tests 

SUtMARY/CONCLUSlON : 

With  triple-marker  screening,  maternal  age  is  no  longer  the  prime  determinant 
of  risk  but  just  one  of  four  such  variables.  In  fact,  maternal  age  and  AFF 
are  the  two  weakest  contributors  to  the  Down  Syndrome  risk  calculation.  Thus, 
all  pregnant  patients,  regardless  of  age,  can  be  considered  candidates  for 
screening  because  it  is  just  as  valid  to  calculate  the  risk  of  an  older  woman 
downward  as  that  of  a  younger  woman  upward. 

Should  the  triple  marker  profile  for  Down  Syndrome  become  standard  procedure 
for  prenatal  testing,  AGE  will  be  ready  to  offer  service  quickly  and 
efficiently. 
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TABLE  1 

Combining  markers 


AFP 

uE3 

hCG 

Case 

MeM 

MoM 

MoM 

1 

0.32 

0.62 

1.15 

2 

0.60 

0.61 

1.44 

3 

0.91 

•0.S4 

1.77 

4 

1.22 

0.43 

1.93 

In  ihe  lour  cases  shown,  all  women  were  30  years  of  age  and, 
despite  dtsparaie  marker  values,  had  an  identical  risk  alter 
screening  ol  1  in  190.  The  table  illustrates  Ihe  point  that  in  tripla* 
marker  screening  it  is  necessary  lo  use  all  three  assay  values. . 
regardless  ol  concentration. 

ATP— ■'MaptaMn.  ICC  — iminin  cnaonc  gonMonspn,  MaM— iwaifli  al  madum, 
u(}— wworauQaiaJ  aw«i 
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(Results  of  the  18  June  1992  MSAFP  Ccxomittee  Meeting) 

1.  AOE  will  now  do  Wilford  Hall's  MSAFP's. 

2.  There  will  be  regular  periodic  meetings  with  Wilford  Hall's  physician 
group  and  AOE  (in  particular,  AOELC),  also: 

3.  Follow-up  communications  -  with  regard  to  various  patients,  as  well 
as,  (for  the  first  time): 

4.  Alternate  meetings  to  be  held  at  AOE. 

5.  Though  the  implementation  of  AFP-TMP  was  not  supported  by  Wilford 
Hall's  MSAFP  Committee  (at  this  time)  the  "need- to- know,"  with  respect  to  the 
patient  was  discussed,  as  was: 

a.  A  future  prospective  DSAP  procedure,  introduced  by  Dr  Barth, 
involving  genotyping  with  DNA  probes  as  being  more  favorable  than  AFP-TMP, 
(100$  accuracy;  no  risk;  this  procedure  to  be  developed  within  3  to  6  years) 
(13). 


b.  If,  on  the  other  hand,  AFP-TMP  is  approved  by  ACOG,  Wilford  Hall's 
medical  group  (OB/GYN)  has  assured  AOE  continued  informational  ties. 

6,  On  the  next  day  (after  the  MSAFP  Committee  meeting)  AOE  conducted  a 
nationwide  Air  Force  telephone  survey.  Total  number  of  Air  Force  clinical 
labs  called:  62 

Questions  Asked: 


a.  Have  your  OB/GYN  clinicians  asked  about  the  availability  of  triple 
marker  testing? 


^  laboratories  -  yes:  55  laboratories  -  no 

•Andrews  AFB,  MD 
Griffiss  AFB,  NY 
Hill  AFB,  UT 
Homestead  AFB,  FL 
•Travis  AFB,  CA 
Williams  AFB,  AZ 
Wright-Patterson  AFB,  OH 

•Keesler  AFB,  MS  •-  Were  going  to  establish 

Wurtsmith  AFB,  MI  AFP-TMP  in-house. 


b.  Do  you  refer  any  requests  for  triple-marker  screening  for  Down 
Syndrome  Risk  Assessment  to  a  commercial  or  other  reference  lab? 


6  laboratories  -  yes: 
Hill 

Homestead 

Keesler 

X  Williams  (closing) 
Griffiss 

X  Wurtsmith  (closing) 


56  laboratories  -  no 


X-  OB/GYN  clinicians  were  in 
the  process  of  being 
transferred 
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c.  If  30,  approximately  how  many  per  month? 


Hill 

Homestead 
Keesler 
Williams 
Griff iss 
Wurtsmith 

d.  Cost  of  Panel? 

Hill 

Homestead 

Keesler 

Williams  (Closing) 
Griffiss 

Wurtsmith  (Closing) 


100;  hCG,  uE3 
3-4:  AFP-TMP 
80:  AFP 
5:  AFP-TMP 
6-2:  AFP-TMP 
14-25:  hCG;  1-5:uE3 


$42-uE3;  $53-hCG 
$62-AFP-TMP 
$17-AFP 
? 

$60-AFP-TMP 

$23.50-hCG 
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PROBABILITY  AND  CUE  TYPE  HANIPULATIONS 
IN  A  VISUAL  ATTENTION  TASK 

Lawrence  R.  Gottlob 
Department  of  Psychology 
Arizona  State  University 

Abstract 

In  a  previous  study,  it  was  found  that  observers  could 
allocate  attention  to  four  locations  in  a  display  based  on 
the  probability  that  a  peripheral  (abrupt-onset)  cue  was 
valid.  The  first  experiment  presented  in  this  report  sought 
to  investigate  the  effects  of  a  central  cue,  and  the  second 
experiment  compared  central  to  peripheral  cues. 

Observers  in  both  experiments  were  run  in  visual 
attention  tasks.  The  target  was  a  T-like  character  in 
different  orientations,  that  was  presented  in  one  of  four 
different  locations  (subtending  a  total  of  12  degrees  visual 
angle)  on  a  computer  screen.  The  target  was  preceded  by  a 
central  (Experiments  1  and  2)  or  a  peripheral  (Experiment  2) 
cue,  and  followed  by  a  mask.  The  probability  that  the  cue 
was  valid  was  varied  over  blocks  of  trials.  The  observers' 
task  was  to  indicate  the  direction  that  the  "T"  was  pointing. 
Proportion  correct  scores  indicated  that  observers  allocated 
more  attention  to  more  probable  locations  (across  valid 
trials  only)  with  a  peripheral  cue,  but  that  there  was  no 
probability  effect  with  a  central  cue. 
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PROBABILITY  AND  CUE  TYPE  MANIPULATIONS 
IN  A  VISUAL  ATTENTION  TASK 

Lawrence  R.  Gottlob 

INTRODUCTION 

In  the  area  of  visual  attention,  the  nature  of  responses 
to  both  abrupt-onset  peripheral  cues  and  central  cues  have 
been  studied.  An  abrupt-onset  flash  on  a  display  has  been 
found  to  initiate  an  automatic  or  transient  response,  while 
central  cues  initiate  a  purposive  and  sustained  response 
(Jonides,  1981,  Tepin  &  Dark,  1991,  Nakayama  &  Mackeben, 

1989,  Posner  &  Cohen,  1984,  Cheal  &  Lyon,  1991). 

In  a  previous  study,  Gottlob  (1992)  used  a  peripheral 
cue,  and  found  that  observers  were  able  to  allocate  different 
mixtures  of  a  fixed  quantity  of  attention  over  four  locations 
of  a  display.  The  allocation  was  affected  by  cue  validity 
expressed  as  the  proportion  of  correctly  cued  trials.  By 
allocating  according  to  cue  probability,  observers  were 
maximizing  their  chances  for  correct  target  discrimination. 
Even  though  the  task  used  a  peripheral  cue,  the  study  showed 
that  observers  did  have  control  over  a  putatively  automatic 
response . 

This  control  was  seen  to  have  come  from  a  top-down 
source.  When  cue  validity  is  manipulated  between  blocks  of 
trials,  individual  trials  are  indistinguishable  between 
conditions;  both  valid  and  invalid  trials  are  the  same  across 
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conditions.  What  varies  is  the  mixture  of  trials  across 
conditions.  If  a  performance  difference  is  seen  across 
probability  conditions,  that  difference  must  be  necessarily 
due  to  top-down  influences  since  the  bottom-up  information  is 
identical  across  conditions. 

The  task  used  in  Gottlob  (1992)  and  in  the  present 
report  is  a  location  cuing  task  similar  to  that  used  by  Cheal 
and  Lyon  (1989,  1991),  and  also  Cheal,  Lyon  and  Hubbard 
(1991).  The  location-cuing  paradigm  uses  a  cue  to  direct 
attention  to  one  of  four  locations  equidistant  from  a 
fixation  point,  followed  by  a  target  stimulus  and  a  mask, 
whereupon  the  observer  attempts  to  identify  the  target  at  the 
cued  location.  (See  Figure  1  for  a  schematic  order  of 
events) . 

The  dependent  variable  of  accuracy  is  measured  at 
different  cue-target  stimulus-onset  asynchronies  (SOA) ; 
for  targets  such  as  the  "T"  pictured  in  Figure  1,  accuracy 
has  been  found  to  increase  as  a  function  of  SOA  until  it 
begins  to  level  off  (approach  asymptote)  at  100  to  150  msec 
(Cheal  and  Lyon,  1989) .  The  SOA  by  accuracy  curve  enables 
one  to  follow  the  timecourse  of  the  build-up  of  attention  at 
the  cued  location.  Experiments  have  shown  that,  in  order  to 
discriminate  certain  types  of  targets,  attention  takes  about 
100  msec  to  ''concentrate  on"  a  cued  location,  (Cheal  &  Lyon, 
1989)  . 
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The  increase  in  accuracy  as  a  function  of  SOA  was 
obtained  in  trials  where  the  cue  was  valid,  and  attention  had 
tine  to  be  oriented  to  the  cued  location.  In  addition,  Cheal 
et  al.,  1991,  included  an  invalid  cue  condition  in  which  the 
target  appeared  at  a  noncued  location.  Cheal  et  al.  (1991) 
found  lower  accuracy  for  invalid  trials  than  for  valid 
trials,  and  the  accuracy  by  SOA  function  for  invalid  trials 
was  flat,  unlike  the  rising  SOA  function  for  valid  trials. 
Valid  and  invalid  accuracy  curves  had  different  shapes  as  a 
function  of  SOA. 

The  performance  difference  between  valid  and  invalid 
trials  is  due  to  the  orienting  effect  of  the  cue.  The 
addition  of  the  factor  of  cue  probability  allows  us  to 
Isolate  top-down  processes  to  determine  whether  top-down 
control  can  be  exerted  over  attention  in  response  to  a 
peripheral  cue.  As  mentioned  above,  with  manipulation  of  cue 
validity  probability,  there  is  no  difference  in  bottom-up 
information,  so  any  changes  in  performance  must  be  due  to 
top-down  influences. 

The  main  finding  of  Gottlob  (1992)  is  illustrated  in 
Figure  2,  which  illustrates  the  growth  in  accuracy 
(proportion  correct)  as  a  function  of  cue-target  SOA,  across 
valid  and  invalid  trials,  and  75%  and  50%  cue  probabilities. 
As  was  predicted,  the  75%  valid  curve  lies  above  the  50% 
valid  curve,  and  the  75%  invalid  curve  lies  below  the  50% 
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invalid  curve.  The  results  were  in  accordance  with  optimal 
search  theory  (Koopman,  1957),  which  prescribe  that  attention 
should  be  allocated  with  regard  to  a  probability 
distribution  of  target  location. 

Experiment  1  in  this  report  was  designed  to  replicate 
the  probability  effect  with  a  central  cue  instead  of  a 
peripheral  cue.  It  was  expected  that  the  controlled  nature 
of  the  central  cue  response  would  magnify  the  probability 
effect. 

method 

Observers 

Two  male  and  one  female  observer  between  the  ages  of  19 
and  22,  with  normal  vision,  were  paid  $7.00  per  hour  to 
participate.  None  were  informed  of  the  purpose  of  the  study. 
Each  observer  practiced  for  a  number  of  sessions  before 
beginning  the  experimental  trials.  One  observer  was 
experienced  in  this  type  of  experiment;  two  were  not.  Each 
was  run  in  24  one-hour  sessions,  for  a  total  of  30,720  trials 
each. 

Appair?ibus 

Stimuli  were  displayed  on  an  IBM-XT  with  an  EGA  color 
monitor.  An  adjustable  head  and  chin  rest  fixed  the  eye-to- 
screen  distance  at  approximately  37  cm.  Eye  movement  was 
monitored  with  a  video  camera  to  ensure  that  the  observers 
fixated  their  gaze  at  a  single  location,  and  that  observers 
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maintained  the  proper  head  position.  Responses  were  recorded 
on  the  numeric  keypad  of  a  standard  IBM  keyboard. 

Stimuli 

Stimuli  were  presented  as  white  pixels  on  a  dark  gray 
backgroxind,  with  a  total  luminance  of  80  cd/m^.  Targets 
consisted  of  ''T''s  which  could  appear  in  one  of  four  locations 
six  degrees  in  radius  from  a  central  fixation  point,  and  in 
one  of  four  orientations  (pointing  right  or  left  sideways,  up 
or  down) . 

Procedure 

Obseirvers  were  seated  in  front  of  the  screen,  with  their 
head  position  fixed  by  the  chin  support.  Screen  refreshes 
were  made  every  16.7  msec,  so  that  the  duration  in  msec  of 
any  stimulus,  target  etc.  was  a  multiple  of  16.7. 

The  order  of  events  in  each  condition  was  identical  (see 
Fig.  1) .  A  fixation  bar  appeared  for  668  msec,  followed  by 
a  16.7  msec  central  arrow  cue  appearing  at  one  of  four 
locations.  The  cue  was  followed  by  a  target  at  one  location, 
and  circles  at  the  other  three  locations,  after  a  variable 
stimulus-onset  asynchrony  (SOA) .  SOAs  used  were  (rounded  to 
the  nearest  msec)  0,  33,  67,  100,  133,  167,  200,  233,  267, 
and  300  msec.  Following  the  presentation  of  the  target,  a 
mask  was  presented  which  was  an  outline  of  all  possible 
targets  at  all  four  locations.  The  observer's  task  was  to 
indicate,  by  pressing  the  corresponding  arrow  on  the  numeric 
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keypad,  which  direction  the  target  was  pointing.  The 
observer  was  instructed  that  accuracy,  and  not  speed,  was 
required. 

There  were  four  blocked  conditions  which  consisted  of 
different  percentages  of  cue  validity;  observers  were 
informed  of  what  condition  was  being  run.  In  the  100% 
condition,  the  cue  always  indicated  the  correct  position  of 
the  target.  In  the  75%  condition,  the  cue  indicated  the 
correct  target  location  on  75%  of  the  trials,  but  on  25%  of 
the  trials  the  target  would  appear  at  any  one  of  the  noncued 
locations  with  equal  (8.3%)  probability.  The  50%  condition 
was  similar  to  the  75%  condition  except  for  50%  correct 
(valid)  cuing  and  50%  incorrect  (invalid)  cues,  split  16.7% 
to  each  of  the  noncued  locations.  The  25%  condition  used  a 
cue  which  was  valid  25%  of  the  time  and  invalid  on  75%  of  the 
trials,  split  25%  to  each  of  the  noncued  locations;  this  last 
type  of  cue  gave  essentially  no  information  since  the  chance 
probability  of  a  target  appearing  at  any  one  location  was  25%. 

Each  observer  was  first  instructed  in  the  task  and  run 
for  a  number  of  sessions  at  the  100%  level  for  training 
purposes.  Target  durations  were  individually  determined  for 
each  observer  on  the  basis  of  performance  in  the  training 
sessions:  33  msec  (CT) ,  33  msec  (CW) ,  or  17  msec  (MU). 

Each  observer  was  exposed  to  24  sessions  of  1280  trials:  six 
sessions  each  at  the  four  probabilities,  with  an  equal  number 
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of  trials  at  each  SOA,  and  counterbalanced  across  cue  and 
stimulus  location,  direction  of  target,  and  target  duration. 
Total  nvonber  of  trials  per  observer  per  condition  per  SOA 
ranged  from  a  maximum  of  768  trials  to  a  minimum  of  187 
trials  per  point. 

Data  were  collected  in  the  form  of  accuracy  scores  (by 
SOA)  for  valid  and  invalid  conditions,  for  both  individual 
observers  and  combined  observers.  An  analysis  of  accuracy  by 
SOA  curves  by  condition  was  planned. 

RESULTS  MQ  DISCUSSION 

Figure  3  illustrates  accuracy  by  SOA  for  all  four 
conditions.  As  can  be  seen  in  the  figure,  the  curves  for 
valid  trials  all  lie  above  the  invalid  curves,  indicating 
that  performance  is  better  at  cued  locations  than  at  noncued 
locations  with  a  central  cue.  This  finding  replicates  Cheal 
et.  al.,  1991  and  Gottlob,  1992.  The  manipulation  of 
interest,  probability,  appears  to  have  no  effect,  as  opposed 
to  the  effect  seen  with  the  peripheral  cue  (figure  2) .  To 
directly  contrast  the  present  findings  with  those  found  with 
the  peripheral  cue,  the  75%  and  50%  curves  are  presented 
alone  in  figure  4  which  shows  that  the  75%  valid  curve  does 
not  lie  above  the  50%  valid  curve  and  that  the  75%  invalid 
curve  does  not  lie  below  the  50%  invalid  curve. 

Data  were  analyzed  for  all  conditions,  using  the 
Hierarchical  Log-linear  analysis  in  BMDP4F,  which  is  a  multi- 
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dimensional  nonparametric  variant  of  the  chi-square  test,  for 
dichotomous  data.  A  siimmary  table  of  partial  chi-squares, 
along  with  significance  values,  is  presented  in  Table  1. 

There  were  significant  effects  of  SOA  and  validity.  There 
was  a  significant  effect  across  observers,  which  was  to  be 
expected  since  target  durations  and  presumably,  individual 
ability  levels,  differed  across  observers. 

There  was  a  significant  validity  x  SOA  interaction  for 
combined  and  all  individual  observers,  indicating  that  the 
invalid  curves  had  different  shapes  than  the  valid  curves. 

The  valid  curves  rose  as  a  fu' — ^ion  of  SOA;  the  invalid 
curves  were  flat  across  SOAs.  There  was  also  a  significant 
probability  x  validity  interaction  in  two  of  the  four  tests; 
however,  the  interaction  was  not  indicative  of  a  probability 
effect  upon  visual  inspection  of  the  curves.  There  was  no 
probability  x  validity  x  SOA  interaction,  indicating  no 
significant  non-parallelity  of  curves. 

There  was  a  significant  validity  x  SOA  x  observer 
interaction,  due  to  individual  differences  in  shape  and  value 
of  curves. 

The  results  indicate  that  performance  at  cued  locations 
improves  as  a  function  of  SOA,  and  noncued  performance  does 
not  improve  with  SOA.  In  contrast  to  the  probability  effect 
seen  with  a  peripheral  cue,  there  is  no  probability  effect 
seen  with  the  central  cue,  indicating  that  observers  cannot 
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allocate  attention  in  accordance  with  cue  probability. 

The  inability  to  allocate  can  be  due  to  several  factors. 
The  first  is  that  processing  of  a  central  cue  can  impose  a 
capacity  limitation  on  the  resources  used  for  allocating 
attention.  As  discussed  above,  there  is  evidence  that  the 
response  to  a  peripheral  is  at  least  partly  automatic,  while 
central  cue  response  is  voluntary.  A  central  cue  must  be 
interpreted  and  attention  must  be  moved,  which  may  leave  no 
resources  for  anything  but  movement  to  the  cued  location. 

Another  possible  impediment  to  allocation  may  be  a 
"mechanical”  problem  of  moving  attention  in  several 
directions  simultaneously.  In  an  "automatic"  task  like 
peripheral  cuing,  it  may  be  possible  to  spread  attention  over 
an  entire  display  at  fixation  (facilitating  detection  of  the 
cue)  and  then  allow  attention  to  concentrate  on  the  cued 
location  while  exerting  some  control  over  the  amount 
remaining  at  the  noncued  locations.  In  contrast,  with  a 
central  cue,  if  central  fixation  is  required  to  interpret  the 
cue,  then  attention  may  be  concentrated  at  the  center.  An 
optimal  allocation  of  attention  for  any  given  probability  cue 
would  rec[uire  the  simultaneous  movement  to  four  locations: 
the  single  cued  and  the  three  noncued.  A  purposive  movement 
of  attention  to  four  separate  locations  might  be  too 
difficult  to  perform. 

A  third  possibility  is  that  the  finding  of  no 
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probability  effect  is  a  type  2  error;  that  is,  the  experiment 
did  not  pick  up  the  (real)  effect.  It  is  hard  to  compare 
results  from  peripheral  and  central  cue  designs  since  we 
didn’t  use  a  true  within-subject  design.  Experiment  2  in 
this  report  was  designed  to  remedy  this  problem  by 
manipulating  both  probability  and  cue  type  (central  vs. 
peripheral)  in  a  within-sxib j ect  design. 

EXPERIMENT  2 

Experiment  2  was  designed  to  directly  investigate  the 
hypothesis  that  probability  allocation  is  possible  with  a 
peripheral  cue  but  not  with  a  central  cue.  To  that  end,  we 
used  two  probabilities  (75%  and  50%)  and  two  types  of  cue 
(central  and  peripheral) . 

METHOD 

Observers 

Four  observers,  three  male  and  one  female,  between  the 
ages  of  20  and  33,  were  used  in  this  experiment.  Two  were 
experienced  in  this  type  of  experiment;  one  observer  (CT)  had 
served  in  Experiment  1. 

Apparatus  and  Stimuli 

The  apparatus  and  stimuli  were  the  same  as  used  in 
Experiment  l. 

Procedure 

The  trials  were  similar  to  those  in  Experiment  1,  with 
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the  following  changes:  (1)  There  were  7  SOAs  -  0,  50,  100, 
150,  200,  250,  and  300  msec.  (2)  Cue  type  was  manipulated 
across  session.  A  central  cue  consisted  of  a  ”V"  presented 
at  fixation  that  cued  one  of  four  target  locations.  A 
peripheral  cue  was  the  same  "V"  presented  1  degree  to  the 
outside  of  a  single  target  location.  (3)  Only  two 
probabilities  were  used  for  cue  validity  -  75%  and  50%,  since 
those  did  not  represent  any  boundary  conditions  as  100%  and 
25%  did.  (4)  Target  durations  ranged  from  33  to  67  msec. 
Number  of  trials  per  observer,  per  point,  ranged  from  185  to 
495. 

RESULTS  MQ  DISCUSSION 

The  data  for  one  observer,  MP,  was  not  included  in  the 
analysis  because  he  was  unable  to  respond  to  a  central  cue. 
Therefore,  the  analysis  will  include  data  only  from  the 
other  three  observers. 

The  accuracy  by  SOA  curves  are  presented  separately  for 
the  central  and  peripheral  conditions,  in  Figure  5.  The 
peripheral  cue  data  shows  a  possible  probability  effect  for 
valid  but  not  invalid  trials,  while  the  central  cue  data 
shows  a  possibly  very  small  effect  for  valid  trials  but  not 
for  invalid  trials. 

Table  2  shows  the  results  of  the  High-Log  Linear 
analysis  for  all  obseirvers  combined  and  for  individuals. 
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Both  central  and  peripheral  show  effects  of  validity,  SOA, 
and  observer.  The  interaction  of  most  interest  is  the  one  of 
probability  by  validity  -  that,  combined  with  an  absence  of 
probability  by  SOA  interaction  (indicating  parallel  curves) 
and  visual  inspection,  can  indicate  further  tests  for 
valid  and  invalid  curves  separately.  The  P  x  V  interaction 
was  not  significant  for  the  central  cue  (indicating  no  effect 
of  probability)  but  it  was  significant  for  the  peripheral 
cue,  which  suggests  further  tests  for  the  peripheral  cue. 

Table  3  shows  the  results  for  valid  and  invalid  trials 
only,  for  the  peripheral  cue.  There  was  no  probability 
effect  for  invalid  trials,  which  is  not  surprising  given 
figure  5  which  shows  the  two  lines  crossing.  For  valid 
trials,  there  was  an  effect  of  probability  for  all  observers 
combined  and  for  two  of  the  three  individuals. 

Therefore,  the  findings  for  the  second  experiment  are 
equivocal.  It  does  replicate  the  lack  of  probability  effect 
for  the  central  cue,  but  it  only  partly  replicates  the 
effects  found  in  last  summer's  experiment  with  the  peripheral 
cue.  The  first  peripheral  cue  experiment  found  probability 
effects  for  both  valid  and  invalid  trials,  while  the  present 
experiment  found  the  effect  for  valid  but  not  invalid  trials. 
CONCLUSION 

Both  experiments  1  and  2  found  that  observers  could  not 
perform  a  probability  allocation  of  attention  in  response  to 
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a  central  cue.  Experiment  2  partly  replicated  last  summer's 
experiment  in  that  it  found  a  probability  effect  for  valid 
trials  (but  not  for  invalid  trials) . 

At  this  point  it  seems  appropriate  to  investigate 
fuzrther  the  peripheral  cue  effect.  The  effect  was 
significant  at  the  0.01  level  for  all  observers  combined; 
however  the  magnitude  of  the  effect  was  small:  the  mean 
proportion  correct  for  the  75%  valid  trials  was  0.861,  and  the 
mean  proportion  correct  for  the  50%  valid  was  0.832.  This 
yielded  a  small  but  consistent  difference  of  0.029  which  only 
represents  3.4%  of  the  average  accuracy. 

One  way  to  boost  the  effect  (if  it  is  real)  is  to 
present  two  probabilities  which  are  more  discriminable  by  the 
observers.  The  difference  between  75%  and  50%  is  only  one 
third  of  the  higher  value.  In  the  next  experiment,  I  will 
try  to  produce  a  larger  effect  by  using  the  two  probability 
levels  of  66%  and  33%,  where  one  level  is  twice  the  other. 
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target  {.reaentation  after  a  variable  cue-target  onset  rlqvfa  Accuracy  (proportion*  correct)  by  SOA  for  Gottlob, 

asynchrony  (SOA).  On  valid  trials,  the  target  appears  at  the  1992.  The  two  top  lines  represent  perforaanca  for  valid 

cued  location.  On  invalid  trials,  the  target  appears  at  one  trials,  and  the  bottoa  lines  for  invalid  trials.  Two 

of  the  three  noncued  locations.  ,  probability  conditions  are  represented:  75%  and  50%. 
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EiSMM  5.  Accuracy  by  SOA  curvea  tor  Exparlaant  3.  Th*  top 
graph  Is  for  cantral  cuss,  and  ths  bottoa  graph  is  for 
psrlphsral  cuss. 
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Abstract 

The  effect  of  orally  administered  exogenous  melatonin  on  oral 
temperature  and  subjective  mood  states  was  tested  in  a  placebo-controlled, 
double  blind  design.  Subjects  were  tested  from  0800  - 1 700.  At  091 5 
subjects  ingested  either  0  mg,  10  mg  or  100  mg  of  melatonin.  Serum 
melatonin  for  both  of  the  treatment  conditions  raised  rapidly  and  remained 
well  above  physiological  levels  for  the  entire  testing  session.  Exogenous 
melatonin  administration  yielded  significant  dose-dependent  decreases  in 
oral  temperature.  Exogenous  melatonin  treatment  also  significantly 
increased  subjective  ratings  of  fatigue. 
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The  Effects  of  Two  Doses  of  Exogenous  Melatonin  on 
Temperature  and  Subjective  Fatigue. 


Rod  J  Hughes 


Introduction 

The  pineal  hormone  melatonin  (MT)  has  been  implicated  as  a  possible 
"hypnotic"  or  sleep  inducing  agent.  The  present  investigation  used  a 
placebo-controlled,  double  blind  design  to  test  some  sedating  qualities  of  two 
doses  of  orally  administered  exogenous  melatonin.  Compared  to  placebo, 
exogenous  melatonin  showed  dose-dependent  decreases  in  oral  temperature. 
Exogenous  melatonin  also  significantly  increased  subjective  fatigue  in  a 
dose-dependent  manner. 

Endogenous  melatonin  is  produced  primarily  in  the  pineal  gland  via 
metabolism  of  tryptophan  (1).  The  vast  majority  of  melatonin  is  produced  and 
released  into  all  parts  of  the  brain  and  all  parts  of  the  body  primarily  at  night 
(2).  There  Is  considerable  evidence  that  melatonin  is  the  "messenger  of 
darkness",  mediating  the  effects  of  photoperiod  throughout  the  body  (2  see 
also  3).  In  fact,  endogenous  melatonin  is  likely  responsible  for  much  if  not 
all  circadian  rhythmicity  in  mammalian  physiology. 

Exogenous  melatonin  has  been  shown  to  have  two  types  of  effects  on 
human  physiology.  First,  exogenous  melatonin  has  chrop'c  circadian 
phase-setting  effects.  For  Instance,  melatonin  has  been  used  to  facilitate 
adjustment  to  jet-lag  (e.g.  4,5).  Second,  exogenous  melatonin  has  more  acute 
sedating  qualities  (e.g.  6).  The  present  investigation  will  focus  on  the 
immediate  effects  of  exogenous  melatonin  administration. 

Evidence  supporting  the  contention  that  melatonin  has  sedating  qualities 
comes  from  several  sources.  Lieberman  et  ai  showed  that  a  large  dose  of 
melatonin  (240  mg  over  two  hours)  produced  significant  tranquilizing  effects. 
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After  ingestion  of  MT  their  subjects  reported  increased  subjective  fatigue. 

They  also  showed  behavioral  decrements  in  the  form  of  longer  reaction  times 
to  psychomotor  tasks  (6).  Others  have  reported  similar  subjective  effects  for 
medium  (50  mg)  doses  of  exogenous  melatonin  (7)  and  for  smaller  (2  mg) 
doses  of  exogenous  melatonin  (8). 

The  few  investigations  that  have  directly  tested  the  effects  of  exogenous 
melatonin  on  sleep  support  the  potential  use  of  MT  as  a  sleep  inducing  agent. 
For  instance,  Waldhauser  et  al  reported  that  large  doses  of  melatonin  can 
significantly  improve  sleep  (9).  Studies  using  lower  doses  of  MT  have  also 
reported  promising  findings  (10,11,12). 

The  primary  advantages  of  using  MT  as  a  sleep  aid  are  that  (a)  MT  is  a 
natural  substance  that  is  metabolized  by  the  jody  every  night  and  relatedly 
that  (b)  MT  does  not  appear  to  show  the  side  effects  associated  with 
benzodiazepines  (the  most  commonly  used  type  of  hypnotic  drug).  For 
instance,  Temazepam  has  been  recommended  as  a  sedative  for  promoting  crew 
rest  (13,14).  Although  Temazepam  has  proven  to  be  effective  in  promoting 
daytime  sleep  (13,14)  it,  like  other  benzodiazepines,  has  the  potential  to 
produce  anterograde  amnesia  (e.g.  13).  Further,  because  of  its  intermediate 
half-life  (metabolic  rate)  accumulation  and  carry-over  effects  are  other 
potential  problems  with  this  and  other  benzodiazepines  (15). 

It  is  hoped  that  since  melatonin  is  a  natural  substance  it  will  not  show 
the  negative  side  effects  associated  with  typical  hypnotic  drugs.  Therefore, 
for  promoting  crew  rest  exogenous  melatonin  may  prove  to  be  a  safe  and 
effective  alternative  to  the  benzodiazepines.  Before  implementation, 
however,  more  systematic  research  on  the  specifics  of  melatonin 
administration  needs  to  be  done.  For  instance,  in  order  to  choose  an  effective 
dose  for  sleep  research  some  of  the  pharmacokinetic  issues  of  melatonin 
administration  need  to  be  documented.  These  issues  are  rate  of  absorption  or 
the  speed  at  which  exogenous  melatonin  is  brought  into  the  system  and 
elimination  half  life  or  how  quickly  exogenous  melatonin  is  metabolized. 
Therefore,  aside  from  testing  the  effects  of  MT  on  temperature  and 
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subjective  fatigue  the  present  investigation  was  designed  to  document  some 
of  the  pharmocokinetics  of  two  morning  doses  of  exogenous  melatonin. 


MflttlQd 

Sutaests 

Six  male  volunteers  between  the  ages  of  23  and  42  participated  in  three 
one  day  sessions  spaced  a  week  apart.  Three  of  the  subjects  were  paid  for 
their  participation.  The  remaining  three  subjects  received  time  off  from 
work  as  compensation.  After  the  experiment  it  was  discovered  that  one  of 
the  subjects  suffered  from  a  sleep  disorder.  Consequently,  his  data  were  not 
included  in  the  final  analyses. 

Matfiiials 

Melatonin  and  Placebo 

The  pill  conditions  were  0  mg,  10  mg  and  100  mg  of  synthetic  melatonin 
(Sigma  Chemical  Co,  Saint  Louis,  Missouri)  mixed,  in  500  mg  gelatin  capsules, 
with  Konsyl  (6.0  grams  of  Psyllium  Fiber  per  teaspoon,  Konsyl 
Pharmaceuticals  Fort  Worth,  Texas). 

Subjective  Fatigue  Scale 

This  is  a  seven  point  self-report  fatigue  scale.  The  scale  ranges  from  1 
(fully  alert;  wide  awake;  extremely  peppy)  to  7  (completely  exhausted;  unable 
to  function  effectively;  ready  to  drop).  This  scale  and  others  like  it  (e.g.  the 
Stanford  Sleepiness  Scale)  have  been  shown  to  be  sensitive  to  fatigue. 

Profile  of  Mood  States  (POMS) 

The  POMS  is  a  computer  presented  five  point  self-report  scale  in  which 
subjects  rate  how  well  a  given  adjective  describes  their  present  feelings. 

65  adjectives  aggregate  into  6  emotional  factors;  Anger,  Confusion, 
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Depression,  Fatigue,  Tension  and  Vigor.  The  POMS  has  also  shown  to  be  a 
sensitive  measure  of  fatigue  as  well  as  being  a  sensitive  measure  of 
emotional  changes  associated  some  drug  administration  (6). 

Oral  Temperature 

Subjects  took  their  oral  temperature  every  hour  using  a  Becton  Dickinson 
(B-D)  Digital  Thermometer. 

Melatonin  Collection 

Systemic  levels  of  melatonin  were  measured  from  samples  of  saliva  and 
serum.  Every  hour  subjects  extracted  approximately  4  cc  of  saliva  which  was 
chilled  in  ice  immediately.  Within  10  minutes  the  saliva  samples  were  placed 
in  a  Clay-Adams  Sero-Fuge  II  centrifuge  and  centrifuged  at  3491  rpms  for 
1 0-1 5  minutes.  About  1  ml  of  sputum  was  then  extracted  and  placed  in 
microfuge  tubes.  These  samples  were  then  stored  at  -70  degrees  Farenheit 
until  they  were  shipped  to  an  outside  laboratory  for  assaying. 

Blood  (10  cc)  was  extracted  every  two  hours  via  an  indwelling  IV  catheter 
with  a  heparin  lock.  These  samples  too  were  chilled  until  being  centrifuged. 
The  serum  was  then  separated  into  1  ml  aiiquotes  and  placed  in  microfuge 
tubes.  All  other  handling  of  the  serum  samples  was  the  same  as  for  the 
saliva  samples 

Melatonin  Assay 

Serum  and  saliva  samples  were  taken  to  Dr.  Reiter's  laboratory  at  UTSA 
Health  and  Science  Center  for  radioimmunoassy  (16). 

Procedure 

Subjects  arrived  at  the  laboratory  at  0700,  at  which  time  a  20  gauge 
indwelling  IV  catheter  was  Inserted  Into  one  of  their  arms.  Beginning  at 
0800  and  ending  at  1700  saliva,  temperature  and  subjective  fatigue  were 
assessed  hourly.  Serum  samples  and  the  POMS  were  taken  every  two  hours. 
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At  091 5  subjects  ingested  a  pill  containing  either  0  mg,  1 0  mg  or  1 00  mg  of 
melatonin.  The  order  of  pill  administration  was  counter-balanced  across  each 
of  the  sessions.  Subjects  were  required  to  remain  in  the  laboratory  for  the 
duration  of  each  testing  session  (0800  - 1700).  While  in  the  laboratory 
subjects  could  read,  write,  work  on  a  computer  or  play  computer/video  games. 
Subjects  had  hourly  access  to  juice,  however,  they  refrained  from  eating  for 
the  duration  of  the  testing  session. 

Analyses 

Two-way  (pill  condition  X  time  of  day)  repeated  measures  analyses  of 
variance  (ANOVA)  were  performed  on  the  data.  3  (pill  condition)  X  1 0  (time  of 
day)  ANOVAs  were  run  on  oral  temperature  and  subjective  fatigue.  3  (pill 
condition)  X  o  (time  of  day)  ANOVAs  were  run  on  each  of  the  factors  that 
make  up  the  POMS  test  (anger,  confusion,  depression,  fatigue,  tension  and 
vigor). 

Resutla 

At  the  time  of  this  publication  only  two  thirds  of  the  melatonin  samples 
had  been  assayed,  however.  Figure  1  depicts  clear  dose-dependent  effects  for 
serum  melatonin  levels  (saliva  data  show  similar  functions).  As  expected, 
melatonin  levels  in  the  placebo  condition  remained  within  physiological 
levels.  The  two  treatment  conditions,  however,  showed  marked 
dose-dependent  levels  of  melatonin.  After  ingestion  of  exogenous  melatonin, 
serum  melatonin  levels  quickly  raised  to  pharmacological  levels  and  remained 
high  throughout  the  testing  session. 

Oral  temperature  revealed  a  significant  main  effect  for  dose  F(2.6)  • 

1 3.22,  u  <  .005.  There  was  also  a  significant  main  effect  for  time  of  day 
F(8,24)  ■  3.82,  Q  <  .005.  The  dose  by  time  of  day  interaction  approached 
statistical  significance  F(16,48)  -  1 .75,  a  <  .07.  Because  of  the  high 
variability  of  the  0800  temperature  data  this  time  point  was  removed  from 
this  analysis.  This  did  not  significantly  affect  either  the  form  of  the  data  or 
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the  statistical  analysis.  Figure  2  shows  the  temperature  data  represented  as 
difference  scores  from  the  0900  (baseline)  time  point. 

Analysis  of  the  subjective  fatigue  ratings  (see  Rgure  3}  also  revealed 
significant  main  effects  for  dose  F(2,6)  -  16.49,  a  <  .005  and  for  time  of  day 
F(9,27)  »  5.89,  c  <  .005.  Overall,  the  100  mg  condition  produced  the  highest 
mean  fatigue  ratings  (0  mg  »  1 .55, 10  mg  «  1 .83, 100  mg  *  2.27).  The  dose  by 
time  of  day  interaction  was  also  significant  F(18,54)  «  2.88,  ft  <  .05. 

The  POMS  fatigue  ratings  (see  Figure  4)  also  revealed  a  significant  main 
effect  for  dose  F(2,4)  »  6.52,  ft  .05.  Again  the  1 00  mg  condition  produced  the 
highest  mean  ratings  of  fatigue  (0  mg  »  35.07, 10  mg  »  36.87, 100  mg  - 
44.26).  The  dose  by  time  of  day  interaction  was  again  significant  F(8,16)  - 
4.83,  ft  <  .05. 

POMS  tension  ratings  yielded  a  significant  main  effect  for  time  of  day 
F(4,8)  -  7.22,  ft  <  .05.  For  these  data  as  well  the  dose  by  time  of  day 
interaction  was  significant  F(8,16)  -  3.12,  ft  <  .05.  Rgure  5  depicts  the 
changes  in  POMS  tension  ratings  across  the  day. 

None  of  the  remaining  POMS  factors  yielded  significant  results  (see 
Figures  6,7,8  &  9).  However,  although  they  were  not  statistically  significant 
the  data  for  subject  ratings  of  POMS  vigor  (Figure  6)  and  POMS  confusion 
(Figure  7)  are  consistent  with  the  finding  that  subjects  were  more  tired  in 
the  100  mg  condition. 

Discussion 

The  results  of  this  investigation  add  to  the  current  body  of  evidence  that 
exogenous  melatonin  has  significant  sedating  qualities.  Melatonin  produced 
significant  increases  in  subjective  fatigue.  This  investigation  also  adds  to 
the  body  of  evidence  that  MT  plays  a  significant  role  in  thermoregulation  (17). 
Czeisler  has  shown  that  going  to  sleep  while  body  temperature  is  decreasing 
leads  to  shorter  latency  to  sleep  and  longer  sleep  duration  (1 8).  Therefore, 
since  sleep  quality  is  known  to  be  dependent  on  body  temperature  some  of  the 
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sleep  promoting  effects  of  exogenous  MT  administration  may  be  secondary  to 
its  effects  on  body  temperature. 

Future  research  should  extend  the  window  of  melatonin  sampling  in  order 
to  better  assess  some  of  the  pharmocokinetics  of  exogenous  melatonin 
administration.  Ongoing  collaborative  investigations  are  focusing  on  this 
issue  as  well  as  studying  some  of  the  behaviorai/performance  decrements 
associated  with  daytime  melatonin  administration.  In  addition,  ongoing 
research  is  directly  assessing  the  efficacy  of  melatonin  on  polygraphically 
recorded  sleep. 

The  present  investigation  was  part  of  an  ongoing  evaluation  of  the 
hypnotic  efficacy  of  exogenous  melatonin.  The  goal  of  this  research  is  to 
demonstrate  that  melatonin  can  be  used  as  an  effective  and  safe  alternative 
to  benzodiazepines  for  promoting  crew  rest.  In  addition,  future  research  will 
focus  on  the  combined  strategy  of  daytime  melatonin  administration  and 
nighttime  bright  light  administration  in  order  to  (a)  promote  daytime  sleep 
(b)  increase  nocturnal  performance  and  (c)  rapidly  phase  shift  endogenous 
circadian  rhythms. 
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Figure  2  Dose-reeponee  changes  in 
oral  temperature  by  time  of  day. 
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Figure  3  Ooee-reeponee  changes  of 
subjective  fatigue  by  time  of  day. 
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Figure  4  Dose-reaponse  changes  in 
POMS  fatigue  by  time  of  day. 
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Figure  6  Dose-response  changes  in 
POMS  tension  by  time  of  day. 
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Figure  6  Dose-responee  changes  in 
POMS  vigor  by  time  of  day. 
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Dose-response  changes  in 
confusion  by  time  of  day. 
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Figure  8  Dose-response  changes  in 
POMS  depression  by  time  of  day. 
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Figure  9  Doee-responee  changes  in 
POMS  anger  by  time  of  day. 
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ASSISTING  AIR  FORCE  INSTRUCTIONAL  DESIGNERS 


Tricia  Jones 
Graduate  Student 
Educational  Studies 
University  of  Michigan 

Abstract 

The  Instructional  Design  Branch  (HRTC)  of  the  Training  Research  Division  at  Brooks  AFB  is 
committed  to  providing  Air  Force  instructional  designers  with  tools  that  support  a  high  level  of  design 
efficiency  and  quality.  One  of  the  primary  thrusts  is  a  project  known  as  AIDA,  the  Advanced 
Instructional  Design  Advisor.  Two  short-term  projects  represent  portions  of  the  final  project.  XAIDA  is 
the  Experimental  Instructional  Design  Advisor;  it  represents  a  limited  prototype  of  the  lesson  content 
specification  process.  GAIDA  stands  for  the  Guided  Approach  to  Instructional  Design  Advising.  This  is 
a  case-based  system  which  provides  guidance  to  novice  designers  about  lesson  design  and  delivery, 
based  on  Gagne's  Nine  Events  of  Instruction  (Gagne,  Briggs,  &  Wager,  1992) 

During  the  course  of  my  summer  research  tenure,  I  was  able  to  apply  a  broad  range  of  my  own 
expertise.  I  was  able  to  combine  knowledge  of  human  factors  and  interface  issues  with  instructional 
design  expertise  in  order  to  evaluate  XAIDA.  Furthermore,  I  again  used  my  instructional  design 
expertise  as  well  as  my  skills  in  software  development  to  refine  the  design  of  GAIDA,  implement  a  case 
for  it,  and  write  instructional  design  commentary  for  other  cases. 

XAIDA  and  GAIDA  represent  different  approaches  to  advising  instructional  designers.  XAIDA 
attempts  to  shield  Air  Force  designers  from  the  need  to  know  principles  of  instructional  design.  GAIDA, 
on  the  other  hand,  presents  the  designers  with  a  particular  model  and  expects  them  to  understand  it 
and  incorporate  it  into  their  own  lessons.  Furthermore,  XAIDA  represents  a  particular  theory  of 
learning  and  knowledge  structure;  GAIDA,  on  the  other  hand,  represents  a  model  of  lesson  delivery. 
The  design  of  AIDA  combines  a  variety  of  tools  in  order  to  provide  support  for  novice  instructional 
designers.  The  complementary  strengths  and  weaknesses  of  )CAIDA  and  GAIDA  reinforce  the  need  for 
such  a  structure. 
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Assisting  Air  Force  Instructional  Designers 


Trida  Jones 


1.  Introduction 

This  sunui^r,  I  was  affUiated  with  the  Instructional  Design  Branch  (HRTC)  of  the  Training  Research 
Division  at  Brooks  AFB.  The  Instructional  Design  Branch  is  conunitted  to  providing  Air  Force 
instructional  designers  with  tools  that  support  a  high  level  of  design  effidency  and  quality.  One  of  the 
primary  thrusts  is  a  project  known  as  AIDA,  the  Advanced  Instructional  Design  Advisor.  During  my 
assignment  at  HRTC,  I  partidpated  in  the  development  of  tools  for  Air  Force  personnel  involved  in 
instructional  design.  I  contributed  to  the  long-range  goals  of  AIDA  by  evaluating  XAIDA  and 
partidp>ating  in  the  development  of  GAIDA. 

1.1  Overview  of  HRTC  Work:  AIDA 

The  Air  Force  faces  a  particular  problem  in  regard  to  designed  instruction.  Instructors  are  chosen  from 
the  ranks  of  experts  in  the  specialities  on  the  basis  of  their  knowledge  of  the  domain.  Many  of  these 
personnel  do  not  have  any  particular  expertise  is  designing  instruction  or  delivering  lessons.  Although 
they  attend  an  instructor's  course  before  their  tour  of  duty  as  an  instructor,  the  course  does  not  cover  in 
depth  the  type  of  information  that  is  necessary  to  become  an  expert  instructional  designer.  Trainers  are 
presented  with  a  formal  model  of  instructional  system  design  (ISD)  (Andrews  &  Goodson,  1980),  but 
many  find  this  too  complex  to  use  effectively. 

To  help  overcome  this  problem,  a  long-term  effort  is  underway  within  HRTC  to  develop  tools  that 
will  assist  novices  in  the  instructional  design  process.  AIDA,  the  Advanced  Instructional  Design 
Advisor,  is  scheduled  to  be  completed  in  FY97.  "AIDA  would  incorporate  prescriptive  advice  about 
course  authoring  based  on  established  theories  of  knowledge,  learning,  and  instruction.  It  would  contain 
a  variety  of  tools.  Some  would  automate  established  processes.  Others  would  assist  or  advise  authors 
about  designing  effective  instructional  materials.  The  tools  might  be  used  as  stand-alone,  special 
purpose  tools,  or  as  integrated  tools  using  a  shared  database  of  course  and  content  information.  The 
purpose  of  the  AIDA  program  is  to  provide  reliable,  effective,  and  efficient  instructional  design 
guidelines  for  USAF  course  designers"  (Hickey,  Spector,  and  Muraida,  1991,  pp.  1-3).  AIDA  would 
incorporate  artificial  intelligence  in  the  form  of  an  expert  system  for  planning  courseware. 

One  segment  of  AIDA  will  contain  a  number  of  shells  which  pertain  to  different  learning  types  or 
learning  goals.  The  shells  would  incorporate  elements  that  expert  instructional  designers  know  to 
consider.  A  subject-matter  expert  would  interact  with  this  shell  to  specify  the  content  knowledge  of  the 
lesson.  The  shell  would  prompt  this  novice  designer,  perhaps  through  a  structured  series  of  questions,  to 
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specify  various  parameters  of  the  lesson,  such  as  the  type  of  learners,  their  motivation  levels,  prior 
knowled^,  target  performance,  etc.  The  shell  would  then  format  and  deliver  the  lesson  using  the  most 
appropriate  delivery  methods. 

AIDA  represents  the  long-term  product  in  the  goal  of  assisting  novice  instructional  designers.  In 
the  short  term,  two  pieces  have  been  under  development.  XAIDA,  which  stands  for  the  Experimental 
Instructional  Design  Advisor,  represents  a  limited  prototype  of  the  lesson  content  specification  process. 
It  incorporates  instructional  design  theories  and  methods  of  David  Merrill  (Merrill,  et  al.,  1990). 
Another  interim  product  is  GAIDA,  the  Guided  Approach  to  Instructional  Deagn  Advising.  This  effort 
was  initially  directed  by  Dr.  Robert  Gagne  and  incorporates  his  nine  events  of  instruction  (Gagn^, 
Briggs,  &  Wager,  1992).  It  is  a  case-based  advisor  focussing  on  lesson  delivery. 

lut  Overview  of  My  Tasks 

My  efforts  this  summer  were  varied.  My  initial  task  at  the  Laboratory  was  to  become  familiar  with 
the  various  resources  that  are  being  used  to  assist  instructional  designers.  I  examined  the  ID  Advisor 
(originally  developed  by  Progressive  Learning  Systems  under  the  Snull-Business  Innovative  Research 
program).  I  learned  the  scripting  language  associated  with  ToolBook  (Asymetrix,  1989),  an  authoring 
platform  for  IBM  PCs  running  Windows  (this  is  the  development  environment  for  GAIDA).  Once 
familiar  with  the  resources,  I  chose  two  projects  to  work  on,  projects  which  were  do-able  in  the  course  of 
a  summer  yet  which  could  contribute  to  the  mission  of  the  branch.  I  evaluated  XAIDA  in  its  current  form 
and  evaluating  existing  GAIDA  modules.  In  addition,  I  participated  in  the  redesign  of  GAIDA  to 
reflect  a  case-based  framework,  created  the  content  for  one  GAIDA  case,  and  specified  the  instructional 
design  guidance  to  accompany  three  other  modules. 

In  this  report,  I  will  focus  on  the  two  primary  tasks:  the  evaluation  of  XAIDA  and  the 
evolutionary  design  process  of  GAIDA.  I  will  describe  my  efforts,  and  explain  how  they  fit  into  the 
context  of  the  ongoing  efforts  of  the  branch. 

2.  Evaluation  of  XAIDA 

HRTC  will  be  evaluating  the  use  of  XAIDA  in  various  contexts.  It  was  suggested  to  me  that  I  evaluate 
this  software  from  the  perspective  of  an  instructional  designer,  in  order  to  provide  formative  feedback 
to  Mei  Technology,  the  contractor  for  the  software.  To  this  end,  I  decided  to  implement  the  same  lesson 
content  (end-to-center  band  marking  color  code  for  electrical  resistorr)  in  two  formats:  as  a  lesson 
generated  by  XAIDA,  and  as  a  stand-alone  lesson  for  GAIDA. 

Currently,  two  manuals  and  a  fairly  detailed  example  lesson  arc  available  for  XAIDA.  One 
manual  is  provided  for  students  running  the  lesson,  the  other  is  a  tutorial  aimed  at  designers.  To 
familiarize  mj^lf  with  the  tool,  I  worked  through  both  tutorials.  In  the  process,  I  took  extensive  notes 
(22  pages)  on  the  usability  of  the  program  and  the  support  it  provides  for  designers.  The  difficulties  I 
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faced  and  the  recommendations  I  made  as  a  result  have  been  transformed  into  a  report  for  Mei 
Technology.  In  the  section  below,  I  discuss  some  of  die  theoretical  perspectives  which  I  applied  to  my 
evaluation. 

2.1  Formative  Evaluation 

The  types  of  questions  that  can  be  asked  to  examine  support  for  computer  use  include:  is  the  interface 
consistent?  can  computer  novices  negotiate  the  interface?  is  there  sufficient  support  for  "power  users?" 
does  the  software  warn  about  or  provide  opportunities  to  recover  from  drastic  changes?  and  so  on. 
Questions  about  support  for  domain  activities  include:  is  the  user  empowered  to  accomplish  the  task  in 
ways  that  are  both  familiar  yet  efficient  and  effective?  can  the  user  make  productive  use  of  the  tool  to 
accomplish  familiar  tasks?  can  the  user  extend  the  use  of  the  tool  to  new  tasks?  In  my  evaluation  of 
XAIDA,  I  attempted  to  address  these  types  of  issues. 

Shneiderman  (1987)  presents  a  model  of  user  behavior  known  as  the  syntactic/semantic  model.  It 
proposes  two  kinds  of  knowledge:  syntactic,  which  refers  to  precise  and  somewhat  arbitrary  bits  of 
information  about  device-dependent  details,  and  semantic,  which  has  to  do  with  general  concepts 
independent  of  any  particular  system.  The  semantic  knowledge  is  separated  into  task  concepts  and 
computer  concepts;  both  sets  of  concepts  consist  of  objects  and  actions.  In  order  to  effectively  complete  a 
task,  there  needs  to  be  a  mapping  between  semantic  task  and  computer  concepts.  Syntactic  knowledge  is 
obtained  through  rote  memorization,  while  semantic  is  acquired  by  meaningful  learning. 

Kieras  and  Poison,  in  their  cognitive  complexity  theory,  describe  three  categories  of  user 
knowledge:  job  situation,  how-to-do-it,  and  how-it-works  (Poison,  1987).  These  are  similar  to 
Shneiderman's  categories,  but  the  labels  are  somewhat  more  intuitive  to  understand.  Job  situation 
knowledge  and  how-to-do-it  knowledge  are  closely  related:  the  former  refers  to  knowing  how  to  do  the 
task  independent  of  system  characteristics,  while  the  latter  is  knowing  how  to  use  the  system  to 
accomplish  the  familiar  task.  How-it-works  knowledge  is  a  user's  understanding  of  how  the  overall 
system  fuiKtions. 

These  categories  can  be  applied  to  XAIDA.  How-it-works  deals  with  learning  to  use  the  software 
itself.  How-to-do-it  focuses  on  how  to  use  a  tool  to  accomplish  a  class  of  specific  tasks:  in  this  case, 
designing  a  lesson.  In  the  context  of  XAIDA,  job  situation  knowledge  could  be  one  of  two  things: 
instructional  design  or  knowledge  decomposition.  Most  software  designers  can  assume  that  users  know 
the  underlying  task,  that  is,  that  they  are  familiar  with  the  job  situation  knowledge.  This  assumption 
is  not  appropriate  for  XAIDA,  since  the  purpose  is  to  support  novices  in  the  domain. 

Researchers  at  IBM,  led  by  John  (Zarroll,  have  studied  people  learning  to  use  software  and 
software  manuals  (Carroll,  1989;  Carroll  et  al.,  1985;  Carroll  k  Mazur,  1984;  Carroll  et  al.,  1988).  When 
using  self-study  manuals  for  software  training,  people  often  avoid  reading  because  the  manuals  are  too 
long,  unnecessarily  repetitive,  and  contain  too  many  technical  details.  Instead,  users  set  up  goals  for 
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themselves  and  explore  the  system  in  terms  of  these  goals;  that  is,  they  exhibit  problem-solving 
behavior.  Most  authors  of  manuals  assume  that  the  reader  will  follow  the  directions  exactly  and  never 
make  any  mistakes;  because  of  this  assumption,  there  are  very  few  error  recovery  procedures  in  the 
manuals.  In  summary,  manuals  traditionally  are  written  with  the  implicit  assumption  that  the 
primary  goal  of  the  user  is  to  learn  the  system;  the  Carroll  group  found  that  most  users  just  want  to  use 
the  system  to  accomplish  a  task  (Carroll  and  Mazur,  1984). 

This  points  precisely  to  the  frustration  I  experienced  with  XAIDA.  I  wanted  to  accomplish  my  own 
self-selected  task,  but  nothing  in  the  manuals  supports  such  an  effort.  In  addition,  the  interface  and 
terminology  are  hard  to  decipher  and  often  hard  to  use.  The  interface  is  hard  to  use,  input  mechanisms 
are  inconsistent  and  buggy.  The  software  uses  unfanuliar  terms,  assunung  too  much  domain  knowledge. 
There  is  no  support  for  error  detection  and  recovery  in  the  manual.  In  addition,  task  support  (for  design) 
is  very  limited,  both  in  the  software  and  in  the  manuals.  I  was  not  able  to  accomplish  a  task  (and 
neither  was  I  able  to  learn  the  system). 

In  short,  the  program  was  difficult  to  use.  Although  my  content  was  specified  beforehand,  and 
although  I  had  already  created  this  lesson  once,  I  was  not  successful  in  building  the  lesson  with 
XAIDA.  In  addition,  I  was  disappointed  with  the  instruction  it  delivered.  I  recommend  that  the 
delivery  mechanism  be  replaced  entirely.  XAIDA  should  serve  as  a  knowledge  sp)ecification  tool.  It 
should  create  well-specified  and  well-designed  "instructional  resource"  objects  that  reside  in  a  library. 
These  resources  could  then  be  accessed  by  the  delivery  mechanism,  according  to  parameters  specifying 
the  learning  situation. 

3.0  GAIDA 

3.1  History  of  GAIDA 

From  1991  to  1992,  Dr.  Robert  Gagne  served  the  Laboratory  as  a  National  Research  Council  Senior 
Fellow.  Gagne  is  well-known  for  his  Nine  Events  of  Instruction  (Gagne  et  al.,  1992),  a  model  of  lesson 
delivery  which  has  been  used  with  great  success  and  is  widely  taught  at  universities.  Dr.  Gagne 
envisioned  a  system  where  instructional  design  novices  in  the  Air  Force  would  examine  cases 
representing  good  instructional  design  practices,  cases  built  around  the  Nine  Events.  As  a  result,  the 
novices  would  be  more  effective  in  their  own  instructional  design.  To  fulfill  this  goal,  Gagne  defined  the 
framework  of  GAIDA  and  directed  its  development  at  HRTC.  The  underlying  philosophy  can  be  found 
in  the  following  quote: 

"The  GAIDA  approach  assumes  that  novice  designers  are  capable  of  understanding  concepts 
of  this  sort  in  their  precise  meanings.  The  further  assumption  is  made  that  directions  of  this 
sort  can  be  'followed',  in  the  sense  that  concrete  instances  of  such  an  abstraction  as  'relevant 
knowledge'  can  be  identified  and  selected  from  the  domain  of  the  equipnnent  data  base  being 
dealt  with. ...  GAIDA  represents  an  approach  to  delivery  of  instruction  embodied  in  a 
computer-based  lesson  on  how  to  design  instruction"  (Gagnd,  1992,  p.  v). 
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Initial  Development 

As  a  result  of  the  earliest  efforts,  GAIDA  consisted  of  three  products. 

The  first  is  a  videotape  consisting  of  3  distinct  segments.  In  the  initial  segment,  Gagn^  introduces 
and  explains  the  Nine  Events.  After  this,  a  model  lesson  is  presented,  showing  an  instructor  teaching 
the  procedure  used  for  handcuffing.  Ultimately,  Dr.  Gagn4  provides  voice-over  commentary  on  the 
instructional  design  while  replaying  clips  from  the  lesson  video,  describing  where  each  of  the  Nine 
Events  is  used  in  the  lesson. 

In  addition  to  the  videotape,  two  software  products  have  been  created  using  ToolBook.  One  is  a 
functional  check  of  the  electrical  circuits  of  the  gun  in  the  F-16  aircraft,  a  procedure  using  a  checklist. 
The  second  represents  a  procedure  that  must  be  memori2ed:  conducting  a  certain  pulmonary  examination. 
In  both  programs,  guidance  to  the  instructional  designer  is  embedded  alongside  the  text  of  the  lesson. 
The  content  of  both  lessons  was  specified  by  Dr.  Gagn4;  programmers  included  Larry  Whitehead  and 
Steve  Hancock.  Each  of  these  is  a  single  "book",  combining  text  of  a  lesson  with  advice  to  designers. 

In  the  pulmonaiy  lesson,  the  user  (assumed  to  be  a  novice  instructional  designer)  is  called  upon  to 
make  a  duplicate  of  the  lesson  content,  with  the  aid  of  some  tools  embedded  into  the  lesson.  The 
intention  was  to  provide  an  opportunity  for  practice  for  these  instructional  designers. 

Along  with  these  finished  products,  a  few  additional  scripts  had  been  written  for  eventual 
integration  into  GAIDA.  The  types  of  learning  represented  are  identification  (naval  ranks),  verbal 
knowledge  (the  history  of  the  Air  Force),  and  rule  learning  (simplifying  algebraic  expressions). 

Early  Evaluations 

The  F-16  lesson  has  been  examined  by  subject  matter  experts  and  instructional  designers  in  a  formative 
evaluation  (C^agne,  1992).  Results  indicated  that  users  found  the  level  of  guidance  to  be  meaningful  and 
helpful.  Users  had  no  difficulties  in  understanding  the  instructions,  and  were  able  to  use  them  to  design 
their  own  lessons.  They  considered  the  resulting  lessons  to  be  satisfactory  for  training  new  specialists. 

At  an  more  informal  level,  I  assessed  the  programs  both  in  terms  of  usability  and  content.  In  my 
opinion,  while  the  ideas  behind  these  products  are  sound,  the  initial  implementation  fell  somewhat 
short  of  intentions.  Readers  become  confused;  it  is  hard  to  distinguish  between  the  lesson  and  the 
commentary  on  the  lesson,  for  they  are  closely  intertwined.  In  addition,  the  practice  contained  in  the 
pulmonary  example  is  both  awkward  and  unrealistic.  It  is  not  a  real  design  situation;  instead,  the  user 
merely  copies  and  pastes  from  one  book  into  another.  Better  support  tools  could  be  created  in  ToolBook; 
however,  this  authoring  program  is  not  used  extensively  by  Air  Force  designers,  so  this  type  of  practice 
may  be  superficial  at  best.  Instead,  it  seemed  more  appropriate  to  redesign  the  case  material 
presentation. 
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3^  GAIDA:  Redesign 

After  identifying  some  of  the  problems  associated  with  the  original  computer-based  implementations 
of  GAIDA,  and  comparing  these  with  the  videotape  component,  1  became  convinced  that  a  new  model 
was  needed.  Fortunately,  there  was  agreement  among  other  members  of  the  branch  about  this  need.  As  a 
result,  I  proposed  that  a  new  model  be  used  to  direct  the  implementation  of  GAIDA.  I  was  primarily 
responsible  for  designing  this  new  model. 

In  contrast  to  the  two  software  portions  of  GAIDA,  I  felt  that  the  videotape  represented  a  good 
model  that  could  also  be  implemented  in  software.  In  addition,  practice  should  be  provided  in  the 
design  and  planning  of  a  lesson,  not  in  the  use  of  any  particular  authoring  environment  or  computer 
program.  The  latest  model  for  the  GAIDA  architecture  is  shown  in  Figure  1. 

In  this  model,  a  GAIDA  case  consists  of  two  parts:  a  stand-alone  lesson  (which  could  be  viewed 
independently  of  GAIDA),  as  well  as  instructional  design  guidance  related  to  that  lesson.  These  can  be 
referred  to  as  "the  lesson"  and  "the  design  commentary".  Users  of  GAIDA  will  first  view  information 
about  the  Nine  Events  of  Instruction;  after  this,  they  would  choose  between  a  list  of  cases.  While 
viewing  the  case,  users  would  read  commentary  on  how  the  lesson  was  designed  and  how  it  incorporated 
the  Nine  Events;  in  addition,  they  could  switch  to  the  actual  lesson  to  experience  the  event  in  the 
context  of  the  full  lesson. 

33  GAIDA:  implementation 

After  developing  a  new  design,  I  implemented  the  materials  for  one  GAIDA  case. 

Stand-alone  lesson  on  resistors 

Based  on  the  advice  of  Dr.  Gagne,  my  first  step  in  developing  a  case  was  to  design  the  stand-alone 
lesson.  Such  a  lesson  represents  the  core  of  the  case.  Oassificadon  was  chosen  as  the  learning  type  in 
order  to  round  out  the  suite  of  lessons.  In  a  classification  task,  the  learner  first  makes  a  discrimination 
or  judgement,  and  then  takes  an  acdon  based  on  that  decision.  We  determined  to  use  a  concept  in  basic 
electrorucs,  that  of  converting  the  color  code  on  a  resistor  into  its  numeric  equivalent.  One  resulting 
action  could  be  to  set  the  range  switch  on  an  ohmmeter  to  ensure  accurate  readings.  The  content  for  this 
lesson  came  from  two  sources:  the  course  materials  for  the  Common  Electronic  Training  Program  (CETP), 
which  is  conducted  at  the  USAF  Technical  Training  School  at  Keesler  Air  Force  Base  by  the  3410 
Technical  Training  Group,  and  various  electronics  textbooks  (Horn,  1990;  Loper,  Ahr,  Sc  Clendenning, 
1979).  The  plan  of  instruction  and  course  chart  in  the  Air  Force  course  materials  provided  a  baseline  for 
determining  prior  knowledge  (Event  3  of  the  Nine  Events)  and  experience  of  the  target  audience. 

Although  the  content  of  my  lesson  was  similar  to  that  found  in  the  course  materials  and  the 
textbooks,  it  differed  in  at  least  two  significant  ways:  through  the  explicit  incorporation  of  the  Nine 
Events  of  Instruction  at  the  design  stage,  and  by  the  varied  range  of  practice  opportunities  afforded  by 
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Figure  1:  GAIDA  Design 
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the  computer.  A  textbook  contains  a  limited  number  of  problems.  Typically,  the  problem  take  the 
following  form:  "Suppose  you  have  a  resistor  with  the  following  marking;  brown  black  yellow  and  gold; 
what  is  the  value  of  the  resistor?".  Textbooks  are  usually  black  and  white  ,  so  if  an  image  appears  with 
the  problem  statement,  it  is  black  and  white  with  names  of  colors  labelling  the  bands.  With  the 
computer  based  lesson,  these  and  other  limitations  can  be  avoided. 

To  achieve  Event  6, 1  provided  two  opportunities  for  practice  in  my  lesson.  In  the  first,  the  trainee 
is  shown  a  resistor  with  colored  bars.  The  trainee  enters  the  numeric  equivalent  into  a  field.  Feedback 
(Event  7)  provides  an  evaluation  of  the  correctness  of  the  response  along  with  the  correct  answer. 
Trainees  can  generate  as  many  problems  as  they  like  by  clicking  on  a  button  on  the  screen.  When  they 
are  satisfied  with  their  ability  to  solve  the  problenns,  they  can  go  to  the  next  page.  The  values  for  the 
resistors  are  generated  randomly,  so  each  use  of  the  program  will  contain  different  problems. 

The  second  practice  opportunity  requires  the  reverse  application  of  the  rule.  The  computer 
randomly  generates  a  resistance  value  and  the  user  is  required  to  "color  in"  the  bands  on  a  resistor  to 
match  that  number.  Feedback  takes  the  form  of  right  or  wrong,  along  with  a  hint  as  to  which  band  is  in 
error.  Again,  users  are  allowed  to  generate  as  many  problems  as  they  wish  to  view.  In  both  settings, 
hints  are  available  upon  user  request;  these  hints  consist  of  the  shortcut  for  the  color  code  along  with  a 
small  image  which  names  the  bands  of  the  resistor. 

For  Event  8  (Assess  Performance),  there  are  two  testing  situations.  In  first  one,  the  user  drags  a 
series  of  resistors  into  correct  order.  Feedback  indicates  which  are  in  error;  the  student  is  not  allowed  to 
progress  until  all  are  in  correct  order.  The  second  situation  requires  the  user  to  set  a  range  on  an 
ohmmeter  based  on  the  value  of  a  resistor. 

3.4  Other  GAIDA  work 

Instructional  Design  guidance  for  resistor  lesson 

After  specifying  the  content  for  the  resistor  lesson,  I  created  the  GAIDA  commentary.  The 
commentary  consists  of  a  description  of  how  each  of  the  Nine  Events  of  Instruction  was  incorjwrated  into 
the  lesson.  Particular  attention  is  paid  to  the  demands  of  a  classification  lesson. 

GAIDA  driver  design  document 

My  implementation  used  the  model  developed  at  the  beginning  of  the  summer.  Another  case,  the 
luval  rank  lesson,  was  simultaneously  implemented  by  Mr.  Larry  Whitehead  during  the  summer. 
Problems  of  integration  and  standardization  became  apparent.  Thus,  near  the  end  of  my  tenure,  the 
design  was  further  refined,  and  I  wrote  a  design  document  for  this  new  model.  (This  is  the  design  shown 
in  figure  1). 

Instructional  design  guidance 

Finally,  I  wrote  (and  extracted  from  other  documents)  the  instructional  design  advice  for  the  four 
modules  which  will  be  incorporated  into  GAIDA. 
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33  GAIDA:  Future  Work 


Shortcomings 

H-.e  stand-alone  resistor  lesson  has  not  been  evaluated,  in  terms  of  ease  of  use  or  content  coverage. 
However,  since  its  primary  purpose  is  to  serve  as  a  design  example  and  not  as  actual  instruction,  this 
taisk  is  not  a  priority.  It  is  more  important  to  evaluate  the  commentary  on  the  lesson. 

Similarly,  the  GAIDA  interface  has  not  received  user  testing.  A  help  function  has  not  been 
implemented.  Testing  may  result  in  further  refinements,  such  as  a  different  mechanism  for  displaying 
the  sample  lesson,  or  may  point  out  a  need  for  additional  features,  such  as  a  notebook. 

Currently,  GAIDA  does  not  provide  an  opportunity  for  novice  designers  to  practice.  Appropriate 
practice  would  be  to  use  the  Nine  Events  (e.g.,  design  or  plan  a  lesson).  We  intend  to  incorporate  an 
environment  wherein  the  users  can  'sketch  ouF  their  design,  by  describing  the  text  and  graphic  content 
of  their  lesson,  lesson  flow,  etc. 

Future  Work 

In  addition  to  usability  evaluations,  GAIDA  should  be  evaluated  as  to  its  effectiveness.  One  approach 
would  be  to  compare  deagners  who  use  this  case-b<tsed  method  to  those  who  have  access  to  the  Gagn6, 
Briggs,  and  Wager  book;  or  to  those  who  have  received  formal  training  on  the  ISD  model.  The  groups 
would  be  required  to  design  a  lesson  on  some  specified  material.  The  designs  would  be  evaluated  in 
terms  of  quality  by  blind  reviewers.  We  would  hypothesize  that  those  using  GAIDA  would  develop 
better  overall  designs. 

Variations  on  this  study  include:  matched  pairs  in  terms  of  subject-matter  expertise  (e.g. 
maintenance,  electronics,  and  so  on)  across  conditions,  have  subjects  design  a  lesson  for  a  learning  type 
found  in  GAIDA  and  one  not  found  in  GAIDA.  Another  potential  hjqx)thesis  is  that  GAIDA  users  might 
be  more  efficient  designers. 

The  Armstrong  Laboratory  is  planning  to  conduct  these  studies  in  collaboration  with  Dr.  Gros  of  the 
University  of  Barcelona  in  1993. 

4.  Siunmaiy 

XAIDA  and  GAIDA  represent  different  approaches  to  advising  instructional  designers.  XAIDA 
attempts  to  shield  Air  Force  designers  from  the  need  to  know  principles  of  instructional  design.  GAIDA, 
on  the  other  hand,  presents  the  designers  with  a  particular  model  and  expects  them  to  understand  it 
and  incorporate  it  into  their  own  lessons.  Furthermore,  XAIDA  represents  a  particular  theory  of 
learning  and  knowledge  structure;  GAIDA,  on  the  other  hand,  represents  a  model  of  lesson  delivery. 

While  they  are  difficult  to  directly  compare,  it  is  possible  to  use  them  to  shed  light  on  one 
another's  strengths  and  shortcomings.  It  is  my  opinion  that  the  lessons  delivered  by  )CAIDA  are  lacking 
many  of  the  features  advised  in  GAIDA.  At  best,  it  only  focuses  on  the  middle  events.  In  addition,  the 
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delivery  mechanism  is  severely  limited.  It  is  tied  directly  to  the  method  used  to  input  the  knowledge, 
although  some  customization  via  parameters  is  possible.  XAIDA  does  not  advise  at  all  about 
parameters.  GAIDA,  on  the  other  hand,  may  be  too  optimistic  -  it  provides  advice  but  not  really 
support,  and  support  may  be  more  essential  for  some  personnel. 

The  intention  of  AIDA  is  to  combine  a  variety  of  tools  in  order  to  provide  support  for  novice 
instructional  designers.  The  complementary  strengths  and  weaknesses  of  XAIDA  and  GAIDA  reinforce 
the  need  for  such  a  structure.  XAIDA  focuses  on  a  subset  of  the  events  of  instruction:  those  of  presenting 
the  material  and  providing  learning  guidance.  Instructional  designers  will  need  advice  on  the  other 
aspects  of  delivering  a  lesson,  thus  GAIDA  will  play  a  role.  As  a  minimum,  what  is  still  needed  is  a 
^xxl  delivery  mechanism. 

During  the  course  of  my  summer  research  tenure,  I  was  able  to  apply  a  broad  range  of  my  own 
expertise.  I  was  able  to  combine  knowledge  of  human  factors  and  interface  issues  with  instructional 
design  expertise  in  order  to  evaluate  XAIDA.  Furthermore,  I  again  used  my  instructional  design 
expertise  as  well  as  my  skills  in  software  development  to  refine  the  design  of  GAIDA,  implement  a  case 
for  it,  and  write  instructional  design  commentary  for  other  cases. 
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The  eohancancnt  of  the  openlional  envelope  of  performonoe  aircraft  has  led  to  inteosfied 
pl98ioi<^cal  stresses  betog  applied  to  the  aviator.  Mjch  emphasis  hu  been  plaoed  on  oonhcting  and 
understandiiig  the  effects  of  sustained,  elevated  -KsZ  stress.  While  Combat  EDGE  has  sought  to  extend  the 
level  of  G  tolerance,  the  G-LOC  effort  has  attempted  to  understand  the  roecfaanisnis  of  gravity  induced  loss 
of  consciousness.  A  more  complete  analysis  of  the  autcmomic  nervous  system  (ANS)  under  gtavitali<mal 
stress  may  aid  in  chancing  operator  capabihty  in  hostile  environments,  and  contribute  to  die  G-LOC  and 
Combat  EDGE  efforts.  Computer  algorithms  were  inxignmmed  using  LabVIEW  on  a  Madntoidi  llfic 

computer  to  assess  ANS  regulation  of  biological  variability  (heart  rate,  Mood  pressure,  and  respiratioa). 

< 

hnplementaticni  of  these  aigorithma  will  indude  the  assessment  of  biologicai  variability  in  individuals  during 
varying  workloads,  sn  orthostatic  tolerance  test,  and  potentially  during  human  oentrifiige  studies. 
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THE  ASSESSMENT  OP 
BIOIjOGICAL  VABlABniTY 

Etfc<rtCW%K«»<Mdl 

IHIBOOTCnQH 

During  •  boctile  cj^ngemeiit,  the  modem  «vMtor  be  required  to  optnU  under  wnteined,  hiah 
+Gz  ascderation.  Wtth  &  normalized  curve  of  G  tolenoce  among  the  aviator  populatkm  (10),  some  aviates 
may  not  be  able  to  oompenaate  for  the  suatahifid  gravitatkxiid  (trees.  The  next  genentko  oi  aircraft  are 
destined  to  enhance  die  magnitude  and  duration  of  -Ksz  acoeicrxtimL  Certainly,  engagements  can  fiarrt  out 
those  that  have  the  catMbffity  to  achieve  exceflence  and  withstand  the  ever  mcreaaing  idiyBoiogicat  strenon. 
However,  such  a  Darwinian  sdectioo  process  would  be  costly  to  both  man  and  machine 

In  an  ^ort  to  reduce  mortality  and  maximize  performance,  the  United  States  military  has  a  tong 
standing  tradition  of  attempting  to  reduce  the  medical  ride  to  aviaton.  Certain  abnonaalitiea  are  cauae  for 
refection  from  ffigfat  trainiiig  or  removal  from  high  pertormanoe  aircraft.  Yet  for  all  the  efifort  expended, 
there  sttfl  appears  to  be  an  unacceptable  loss  of  aviators  and  aircraft. 

To  maximiae  aviator  survivability  and  performanoe,  some  of  the  Arraatroog  Laboratory  Crew 
Tedinology  Diviaon  research  effixts  have  focussed  on  Combat  EDGE,  G4XX:,  and  spatial  diarxieiitatioa 
Of  potential  importanoe  to  the  Combat  EDGE  and  G4XX^  effiorts  is  bow  the  Autonomic  Nervous  System 
regulates  the  per^herai  drculatioa  during  gravitational  stress.  Although  quite  contested,  it  has  been 
suggeried  that  individiials  who  are  highly  oonditioaed  may  extibit  an  inadequate  response  to  -Kiz  and  have 
altered  HR  variability  under  varying  workloads  (13). 

WMiout  ooumermeasurcs  a  relaxed  individuai  will  pass  out  at  '*-5  Gz  (10).  However,  the  current 
generation  of  oauntenneasures  can  provide  surtained  protectioa  up  to  -K)  Gz  (10).  Rqganhess  of  the 
oouotenneasuiea  involved,  certain  individuds  are  more  suaceptfi^  to  G^’LOC  that  otfaen,  leading  to  a  baste 
questionof  "What  foctorsindnie  an  individual  wiftiGtoleraitee?* 

Heart  rate  variability  is  an  appropriate  place  to  initiate  a  oomixebenstve  study  ol  an  individuals 
ANS,es  it  should  theoretically  be  a  sensor  of  aO  tmjnngements  on  the  drculatoiy  system.  Standards  for  die 
reoorditi^  of  an  ECG  signal  have  existed  for  over  50  years  (2,12,19,25).  However,  there  has  been  reiadveiy 
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litde  standardization  of  the  newest  methodoi<^es  such  as  signal  avenged  ECG,  and  the  aaaessmait  of  heart 
rate  variabSity.  The  purpose  of  this  Summer  reaearefa  tnoject  was  to  develop  an  understanding  of  the 
anrent  meduds  of  HR  variability  analysis  and  develop  the  software  to  implemeat  the  findings  at  AiyCFTF 
for  die  assessment  of  HR,  Mood  pressure,  and  resjnntory  variability. 

BAqKG«PTOP 

Btoio^VarMOUf 

Although  often  described  as  beating  in  a  normal  rhythmic  pattern,  giving  rise  to  such  tenns  as 
nonnal  sinus  rhythm,  the  human  heart  esdulnts  a  <]uantifiaUe  beat  to  beat  variability.  There  is  also  variaUlity 
throughout  the  day,  referred  to  as  the  circadian  rhythm.  By  quantifying  the  standard  deviadtm  of  the  R>R 
intervals,  some  insight  can  be  gained  into  the  variable  nature  of  the  heart  beat  intervals.  The  use  of  modern 
mathematical  tools  has  led  to  ttw  ability  to  map  the  time  domain  signal  into  the  firequency  domain.  It  is  the 
fiequency  domain  that  has  provided  the  most  insight  into  the  beat  to  beat  variability  of  the  heart. 

Data  in  the  fiequency  domain  is  typically  analyzed  in  teima  of  a  power  spectrum  Total  power  for 
R>R  interval  variability  is  measured  at  the  inh^prai  from  0.01  Hz  to  0.4  Hbc.  Power  is  also  measured  in  three 
qiecific  fiequency  ranges;  0.01-0.04  I&;  0.04-0.1S  Hz,  and  O.lS-0.4  Hz.  These  firequem:y  ranges  are 
infiueoced  by  neuroluimoral,  sympathetic  phis  parasympathetic  infliienoes,  and  parasympathetic  influences, 
respecdvdy. 

Ordiostatic  tolerance  is  ciimcally  assessed  using  a  standing  test  or  a  tilt  table  test.  This  test  is 
designed  to  evaluate  an  atmonnal  response  to  a  shift  in  bocfy  fluids.  Blood  pressure  and  heart  rate  are 
assessed  while  tihe  subject  is  in  a  sufRne  position  and  again  while  in  the  upright  positioa  A  drop  in  systolic 
pressure  of  greater  than  25  mmHg  or  diastolic  drop  of  greater  than  10  mm  Hg  is  ^mormai  (I).  Tlmse 
changes  in  pressure  are  primarily  dim  to  a  drop  in  pcnpheraireastanoe  (23).  A  variety  of  &ct<Hi  impinge  on 
the  r^td4fr(»  of  blood  pressure  upon  standing  hichiding;  the  baroreoeptms,  catecfadamines,  ttie  renin- 
angiotenain-aldosterone  system,  and  vasopressin  release  (1).  All  of  these  fru^tors  can  have  an  influcnoe  upon 
the  heart  rate  in  additkm  to  the  BP.  However,  heart  rate  may  not  alwqrs  be  altered. 


Page  17-4 


The  measurascnt  of  HR  and  BP  variability,  should  not  be  construed  as  the  only  method  to  assess 
theANS.  Many  drugs  iiifluetwe  ANS  pafbnnanoe.  Various  neuropiUiMs  can  also  severely  impinge  on  the 
adequate  Sinctioning  of  the  ANS.  Thcnfixe,  neuiok^^  have  an  esetaaive  protoed  to  eatddish  the 
location  of  ANS  disorders.  Stidi  a  neurdogical  profile  may  need  to  be  considered  to  gain  a  more  accurate 
representation  of  who  may  have  reduced  G-  tolenmce. 

ECGStgmM 

The  electrical  signal  reoMded  and  refiened  to  as  the  dectrocardiognm  is  a  fimetion  of  the 
dqxdarization  and  repdaiization  of  the  heart.  Two  types  ofstandardECG's  are  typically  conducted  a  3  lead 
or  a  12  lead.  For  heart  rate  variability  analysis,  the  lead  most  typically  used  is  the  chest  lead  (Lead  IQ. 
Before  the  dectrical  activity  is  recorded,  it  foQows  a  treacherous  path  through  many  noise  producing  agents 
and  signal  strength  reducers.  The  tissues,  such  as  hmg,  bone,  musde,  and  fid  act  as  resistors  whiefa  serve  to 
attenuate  the  signal  and  diange  its  form.  Intense  muscular  activity,  power  surgea,  and  60  Hz  interference 
can  all  cause  an  increase  in  the  noise  picked  iqi  by  the  electrodes  (8).  If  there  is  not  a  solid  connection 
additional  interfsence  is  encountered 
BPSitnal 

A  nonoontinous  BP  measurement  can  be  made  uaiiig  a  sphygmomanometer  cuff  and  fistenhig  for 
the  sound  of  blood.  Additional  appoaefaes  eadst  for  the  continuous  measurement  of  blood  pressure.  An 

indwelling  artoial  catheter  can  be  used  to  get  a  oontinuoua  readiag,  altfaough  diftortion  finom  air  bubbice  end 

"catheter  wfa^"  are  possibte  (24).  Another  approach  is  to  use  a  nonhivasive  device  to  measure  arterial 
pressure.  Hus  is  the  basis  fm:  the  Finapress  device,  wfaidi  uses  a  fingertip  BP  monitor. 

^leolof  Pr^ttm 

The  purpose  of  {vefilteting  the  analog  signals  m  biological  varubdity  analysis  is  to  avoid  the 
introductiem  of  errors  due  to  aliasmg,  The  ideal  prefilter  for  vaiiainlity  studies  should  have  minimal 
attenuation  in  the  fiequenctea  of  interest  ({Husband,  0  to  125  lb)  (2SX  and  create  a  large  sigoal  attamatkm 
at  hig^  frequencies.  The  cutoff  and  stojAand  are  two  critical  fi’equendes  in  the  design  of  a  prefilter.  Urn 
aut<^  firequency  (Fc)  is  the  point  where  signal  attenuation  is  initiated  (on  attenuation  of  -3  dB).  The 
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stopband  frequency  (Fs)  is  ^wfaere  attenuatioa  ia  typically  greater  than  -70  dB.  The  ratio  of  stopband 
&equen<7  to  cutoff  fieqiwncy  is  often  calculated  to  give  a  measuremeat  of  the  rolloff  from  paaaband  to 
stopband  fri  a  iovipass  filter,  tbe  passband  region  ia  before  the  cutoff  frequency.  In  the  pasaband  region, 
rninifiiiaMtkm  of  ripples  is  important. 

There  are  cuirently  four  pqnilar  types  of  analog  fitters;  Butterworth,  Cbcbyshev,  Elliptic,  and 
BeaseL  The  Butterworth  offers  a  maximalty  flat  passband,  white  the  Chebyfoev  has  the  iteepeat  tranaitioa 
frmn  passband  to  stopband.  The  Bessd  fitter  baa  a  iruudmaJly  fiat  time  delay.  The  Butterworth  fitter  trades 
off  everything  dse  for  maxiinal  flatness.  However,  the  Butterworth  has  leas  ftian  ideal  phase  characteristios. 
If  I  dB  ripples  in  the  passband  are  acceptable,  then  a  Chdiyahev  fitter  can  be  used.  It  allows  greater 
sharpness  at  the  knee  (3  dBX  but  has  less  than  ideal  phase  characteristics.  However,  no  matter  what  deagn 
is  employed,  flatness  of  the  passband  is  never  really  possfote.  Design  criteria  calls  for  very  tight  tolerances 
of  ftte  reastors  and  capacitors  (134X  but  some  ripple  stiQ  occurs.  The  elliptic  or  Cauer  fitter  is  a  variation  of 
flie  Cbcbyshev  fitter  and  is  easy  to  implement  with  CAD.  The  dliptic  fitter  is  indicated  where  tbe  shape  of 
the  waveform  is  important. 

SampOifFhetmaKj^ 

Tbe  sampling  fimpicncy  is  determined  by  multipiying  the  stopband  of  the  pr^er  by  2.  Thisvahie 
is  then  muttipiied  by  the  number  of  channels  to  get  the  total  sampling  rate.  Sampling  at  twice  the  stoidnud 
frequency  allows  fcM’  1  bit  rescdution  to  be  achieved  if  the  stopband  is  at  -72  dB.  To  simplify  tbe  calculation 
of  steaage  space  requirements,  the  sampling  rate  should  also  be  a  power  of  two. 

Tbough  the  use  of  signal  dedmation  techniques,  it  is  possible  to  alter  tbe  tamiding  rate.  For  a 
signal  sampled  at  2S6  Hz  and  1024  Hz,  the  errors  for  sampling  a  100  BFM  heart  rate  signal  range  frmn  2.34 
to  0.39  msec,  respectively.  However,  it  is  posable  that  both  sampling  rates  may  give  the  $amt  informatitm 
abrmt  tbe  heart  rate  intervals.  The  only  way  to  assess  this  is  by  dc^  a  signal  to  noise  analysis. 

Pizzuti  et  aL  did  an  analyiis  of  sanqriing  rate  and  ECG  using  a  system  with  ±9  bits  aocuncy  (20). 
San4>Iing  at  230  Hz  rather  than  300  Hz  is  acceptable  for  large  interval  measurements  such  as  R-R  tntervai 
(20).  However,  sampling  at  125  He  introduced  a^uficaoit  errors  in  interval  measuremeott  (20).  Merri  et  al. 
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studied  signal  to  noise  ratios  of  R-R  interval  measurements  at  various  sampling  frerpiendcs  (16).  The 
UglMstqualttysignal  was  at  2S6  Hz,  followed  by  sampling  at  128  Hz  and  64  Hz  (16).  Again,  sutmorting  the 
cnttfqnfiflti  ftf  iuimpHng  at  greater  than  250  Hy.  ft  wm  fuiggaiited  that  the  more  vaiiaMe  the  R-R  mtetvali  are, 
the  greata*  die  noise  and  tbere&ne  the  inkier  sampling  fieqiMOcy. 

To  insure  the  int^tdy  of  the  data  analysis,  it  is  i^iproiniate  to  conduct  a  signal  to  noise  test  on  each 
set  of  data.  This  will  insure  that  no  errors  are  introduced  in  die  analysis  through  improper  samplmg  rate 
selectioo. 

Pmk  IWarriwi  AIgmMim 

Oime  an  analog  ECG  signal  has  been  pre-dttered  and  converted  into  digitBl  form  by  sampling  at  an 
appropriate  rate^  the  R  peaks  of  each  (^tS  compiegc  must  be  detected.  Over  duty  seven  different 
methoddogies  have  been  publidmd  conoeming  the  detection  of  peaks  (8).  One  of  the  simpteat 
approaches  to  use,  if  die  EGG  signal  has  a  stable  basehne,  is  a  peak  threshold  detection  of  a  negative 
derivative.  When  the  derivative  signal  crosses  the  zero  line  (oonespooding  to  a  R  peak)  the  peak  detector 
reomds  the  time  of  oocurrenoe.  if  the  data  is  not  free  d’ noise  or  unstable  tUs  will  not  work  accurately. 

Fiiescn  presetds  two  ^iproadies  that  appear  to  be  very  appropriate  for  die  analysis  of  the  ECG 
during  cernrifiigation  studies  (8).  The  first  QRS  detector,  reftned  to  as  AFl,  uses  in  algoritfam  that  includes 
the  first  (terivative  and  amplitude  in  the  analysis  (8).  This  method  is  exceptimial  at  the  detecdon  of  a 
complex  even  with  Ugh  levels  of  EMG  noise.  The  second  filter  is  referred  to  as  N't  and  uses  digital 
filtering  methods  (8).  This  method  is  a  good  choice  for  peak  detection,  under  a  variety  of  noisy 
ffivironments.  The  reader  is  referred  to  Friescoet8l.,fr>r  a  mote  detailed  diacusstoa  of  QRS  dcSectkn  (8). 
Jnstaxuaieom  Hem  Erne  DeiernbrntUnt 

Aithou^  the  heart  rate  power  spectrum  can  be  ocanputed  using  just  the  R<R  intervals,  it  tends  to 
create  errors  in  the  qiectnun.  To  ditain  a  more  predae  power  spectrum  representatioii,  the  heart  rate  needs 
to  be  tamided  at  regularly  spaced  intervals;  in  other  words  an  instantaneous  heart  rate.  Five  metbodoiogicai 
approaches  exist  for  the  determination  of  mrtanteoeous  heart  rate  (S,  6).  Factors  to  be  coosidered  include 
ootrecting  for  the  delay  enoountered  in  some  of  the  methods  and  the  errors  indiwed  by  each  methodological 


approach.  De  Boar  etal.pre8ent  three  methods;  spectrum  of  iotavaia,  spectrum  of  counts,  and 
invcne  istcigreb  (6).  Berger  etal.  present  two  methods  (S).  The  first  mcdK>d  invehes  takk«  the  reciproGal 
of  the  R-R  interval  duiatioa  and  paaaii^  it  throu^  a  low  pass  filter  window  whh  a  wkUi  of  2  times  the 
sampling  fiequency. 

An  equivaieat  ^rproach  to  detennine  instantaneous  rate  is  to  let ; 

R^  where  R  is  tire  instantaneous  rate,  is  the  sampling  rate  and  iij  is  the  sum  all  firactiems  of 

intervals  covered  by  that  window  (3). 

Often  data  fixmi  other  brological  signals  are  gathered  in  cot^ction  with  the  ECX>.  These  have 
induded  sudi  firntms  as  Mood  pressure  and  respiration.  These  signals  present  a  uinque  iHocessing  challeage 
vdam  compared  to  that  of  the  heart  rate  aignaL  Typcdly,  sampling  would  need  to  be  reduced  to  4  Hz  for 
ECG,  BP,  and  respiratioa.  Thta  would  allow  us  to  evaluate  the  interval  fixxndc.  to  2  hz.  Ebwever,  Berger 
et  al.  suggest  that  the  power  spectrum  is  only  accurate  to  the  sampling  fi’equency  divided  by  4  (S).  This 
would  produce  a  qiectrum  fir<»n  0  to  i  Hz,  The  major  dilemma  regudlen  of  the  final  width  of  the  power 
spectrum  desired,  is  the  msnyfold  reductiott  in  sampling  rate,  hi  the  case  of  the  current  protocol  the  signal 
was  reduced  fixim  a  sampling  rate  of  1024  Ib  to  4  Hz,  a  &ctor  of  236.  Digital  filtertag  techniques  are 
necessary  when  the  magnitude  of  biological  signals  is  reduced. 

Decimation  is  downsampling  by  lowpassfittering  followed  by  oomprcsiionf  1 7).  Berger  etal,  use 
a31  point  low  pass  filter  followed  by  a  congnession  of  a  foctor  of  4.  lUsaiqnDacfawiUbeusedtogofiom 

1024  Hz->  236  Hz ->  64  Hz^  16Hz<>4H^  An  ided  low  pass  filter  is  oftbe  form  (17): 

^p(n>-8m(Wgnypn 

-  Cwwpwiftif  the  Power  Spectrum 

The  power  spectrum  can  be  computed  after  the  biological  signal  ii  in  digital  form  and  preferably 
converted  to  an  instantaneous  sampling  rate.  Nine  methods  exist  for  the  gcoeadkm  of  power  spectrums 
fiom  time  domain  signals  (1 1).  One  of  these  approaches  has  three  vaiiatioas,  biingit^  the  total  to  eleven 
poasfole  metfaodi  with  their  own  diitmct  advamages  and  drawbacks  (11).  The  power  apectrum  is  an 
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aMewmeptoftherdativeoontrfliutionofeadifrequa^  For  instance,  if  a  signai  rqwats  at 

exactisra2Hzpcnod,suGfaa8tsii]ewKveasiiig^ed)arppeakwi&a{q)ear.  ikwever.  if  a  signal  is  ooinpoacd 
of  rqxadng  ccnii{x»ieQts  of  various  periods  of  duration,  multiple  peaks  are  possible. 

Hie  generation  a  power  spectrum  is  typically  based  on  two  approaches;  the  autor^ressive 
qiptoach  and  the  &st  Fourier  transfeum  (JFFT)  baaed  apjaoacbes.  Press  et  al  provide  C  algorithms  for  bodi 
apfffoaofaes  (21).  The  AR  approach  has  the  advantage  over  die  FFT  in  that  it  does  not  reqnim  a  power  of 
two  for  the  ouiid)er  of  samples  (11).  Additionally,  less  sanqiles  are  required  (1 1).  On  die  other  hand,  mcne 
care  must  be  employed  in  the  use  of  the  AR  ai^rroach.  The  proper  order  of  the  AR  aiq[voach  can  be 
determined  using  the  Alriake  Infiarmatiim  Ciitewria  (AIC)  (IS).  The  easiest  method  is  the  FFT  based, 
pmiodogram  approadL  hfinimal  computationai  and  software  design  strain  is  necessary  to  generate  the 
power  spectrum.  A  variation  on  this  tfaenw  is  the  Blackman-Tukcy  approach,  employed  by  Cohen's  group 
(S).  A  practical  application  of  the  AR  approadi  is  in  Bartoli  et  al  (S)  and  an  addidmal  algorithm  is  in 
MatpIe(lS). 

f>iweissartaw  if JPmw  S/netimm  RtamMt 

After  the  selection  of  amcdiod  fat  ocmsputii^  the  power  spectrum,  the  results  must  be  presented  in 
an  undestandable  format.  On  a  two  dimensional  graph  of  die  power  spectrum,  the  fiequency  is  along  the 
xaxis.  Conversely,  the  x-axis  may  also  be  in  units  ofeydes/beatiftheDe  Boar  ^iproacb  is  followed.  This 
may  also  then  be  oonvwted  to  equivalent  Hertz  by  multiplying  the  x-axis  values  bi  the  mean  R-^  interval. 
The  values  along  the  y-axis  are  unitiess  for  normalized  data  and  in  mats  of  msec^/Hz  if  not  normalized.  If 
the  De  Boar  method  is  used  the  units  are  msec^/cydes/beat.  Normalization  of  the  power  qiectrum  occurs 
when  the  power  spectrum  values  are  divided  by  the  mean  of  the  interval  squared.  The  next  factor  to 
oemsider  is  whedier  the  amplitude  or  power  spectrum  is  being  plotted  on  the  y  axis.  Amplitude  is  sunfdy  the 
square  ro(X  of  the  power  spectrum. 

Once  the  various  mediodoiogiea  are  sorted  out,  it  is  necessary  to  intenxet  the  power  qieotrum 
density  plots.  Of  considetable  importance  is  if  the  subjects  were  under  contrdled  breathing  or  fine 
breathing.  Controlled  breathed  at  a  regularly  spaced  interval  will  cause  a  large  spectral  peak  at  the 
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fivquescy  of  breathiag.  Whereas  free  breaduag  causes  a  more  diffiise  peak.  Has  points  out  the  stuontag 
influence  of  vagai  imieivatioii  on  heart  fate 

Resolution  of  flie  spectral  plot  is  vexy  invortant.  The  maxinann  reaohitioa  is  a  fimetiem  of 
wi&  T  equaling  flic  period.  For  5  minutes  the  resofaition  would  be  1/300  Hz  or  .0033  Hz.  A  laager  period 
is  necessary  to  generate  aamDer  interval  values.  Thus,  resolutioa  is  enhanced  significantly  as  the  period  is 
inenased. 

Befixe  tbe  methods  section  is  presented,  two  of  fl»  published  approadwd  to  the  assessment  of 
biological  variability  win  be  presented  AdditoaaiaigiroaGbes  are  also  available  (4, 14,22). 

De  Boar  ShAadtfPiemr  Spectrum  AtnifyMla  (7) 

1.  For  a  period  of  S12  beats  the  R-R  interval  duratioa  (IX  Systolic  Pressure  (SX  Diastolic  Pressure  (DX 

Mean  Pressure  (MX  and  Pulse  Pressure  (P)  wiO  be  detennmed 

2.  Detennioeflie  means  for  the  above  variaMes. 

3.  Subtract  the  dc.  component 

4.  <k>anFFT 

5.  Power  Density  Spectra  will  be  FFT^2 

6.  Smooth  with  a  31  point  triangle  window,  fliis  is  also  known  as  a  Bartlett  window. 

7.  Convert  to  ecpnvalent  HZ  by  multiplying  by  tbe  l\mean  R-R  interval.  If  this  is  not  dime  the  units  are 

cydes/beat 

8.  Letting  B^S>iD,P  then  ccmipute  cross  spectra  f<x  each  blood  pressure  variable  with  heart  rate 

Cbi(0=X*b(0*3^0  =  MO-KKO 

L(l)  is  called  the  coqrectrum  and  Q(f)  is  the  quadrature  qrectram.  Remember  that  these  have  been 
smoothed 

9.  Compute  tbe  smoothed  estimator  of  die  (diase  spectrum  for  each  variable. 

pai(fHrc  tan  (-Q(0/L(I)) 

pa(Q  over  the  range  of  ±180  degrees 

ThtBorgortiid.  asmhsdffl 
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i.  DetamineR-R  intarvais 


2.  not  as  l/b  £6r  all  a 

3.  Pass  tloougb  a  boxcar  (rectangle  window)  with  the  following  parameten: 

W(£h{(«ii(2p/4fr)l/[2H«frl)2 

wfaere>l’ is  the  new  8aiig>ling  frequency  and  2^  is  the  widdioftfae  window  in  Ib.  The  value  fisr^*  is  usually 
taken  to  be  4  Hz. 

4.  C(»npute  mean  and  remove  d.c.  component 

5.  Compute  (|FFI)2 

6.  apply  a  I/W(f)  correction  to  the  (FFiy 

7.  Compute  die  square  root  oftue  values  in  nund>er  6  to  get  amplitude 

8.  Plot 

MElHOPg 

rmiftiwff 

The  data  necessary  to  conduct  the  heart  rate  vifialxltty  studies  may  be  collected  on  a  wide  range  of 
ECG  machines.  It  is  essentiai  that  each  madiine  be  tested  using  a  device  similar  to  that  recommended  by  the 
AHA  (12).  The  impedance  of  each  dectrode  should  also  be  evaluated  to  insure  a  solid  connection,  thus 
miniiDizing  inteiferenoe.  Nbiltiple  types  of  ocmtinuous  BP  monitors  are  sivaiiable.  The  Finapress  (Ohmeda, 
Englewood,  COX  of  die  more  widely  used  maefames.  A  penund  of  the  literature  and  manufacturer 
spedficadons  must  be  done  to  ascertain  any  restricdons  placed  on  the  analysis  by  the  internal  sigpai 
processing  of  the  chosen  BP  machine. 

AU  data  gathered  Emm  die  BP  and  HXs  machmes  should  be  passed  through  an  analog  preStter 
prior  to  conversion  to  digital  form.  The  prefriter  stopband  for  each  cfaannd  is  based  <m  the  stopband  of  the 
cfaannd  recpdiing  dm  fiutest  acquisition  speed.  These  fitten  may  be  oommetcial  or  custom  bialt.  Ifcustom 
built,  a  function  generator  and  osdUosoope  are  necessary  to  evaluate  their  effectiveness. 

The  oooverskmofdie  analog  s^nai  to  digital  lequnes  one  ofthe  many  available  A/D  boards.  Two 
stipulations  csdst  First,  it  must  be  oompatiUewidi  the  ohoecnoonqxjter  system.  Second,  its  througliput  must 
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be  fiist  enough  to  transfer  data  from  all  of  the  signals  being  processed.  As  a  mimmiiin,  it  should  have  8 
charoieis,  12  tnt  rescdudon,  and  a  throughput  of  10,000  Hz.  Although,  16  duumds,  12  bit  resohiticm,  and  a 
100,000  Hz  would  aflow  for  the  expansion  of  the  number  of  signals  simuttaneously  evaluated. 

Many  omnpufer  solutions  exist  for  the  implementation  of  the  software  analysis  including; 
mainftamc;  DEC,  IBM,  and  Macintosh.  The  AL/CFTF  system  was  a  Macintosh  Iffir  40  Mhz  conqwter 
running  the  LabVIEW  instrument  system.  Software  development  can  occur  with  packages  sudi  as 
LabVlEW,  LabWINDOWS,  or  Ifyperceptirm.  lliese  ezrviroamenta  m»Ve  programmiag  easier.  However, 
expencoced  C  prc^pammers  can  use  Think  C,  Borland  C-h-,  or  Microsoft  C/C-h-  version  7.0  to  adneve  the 
programming  objectives. 

As  an  intermediate  step,  a  HoneyweU  or  Raycal  tape  recorder  can  be  used  to  store  the  anak^ 
signals  prior  to  procesang. 

1.  Use  a  Macintosh  conqniter,  with  at  least  32  Mb  of  RAM.  running  LabVIEW 

2.  For  a  4  duumeia  system  the  A/D  board  should  have  a  throughput  of  at  least  10,000  Hz 

3.  A  prefilter  is  required  for  each  channd.  These  may  be  bought  at  considerable  expense,  or  a  lower  quality 

filter  can  be  built  without  too  much  difficulty.  These  filters  should  have  a  fiequency  cutoff  of  100 
or  125  Hz  and  a  stopband  that  is  approximatety  -70  dB. 

4.  HaalMy  check  Compute  the  characteristics  of  the  filter  using  a  fimction  generator  and  an  oscilloscope. 

Compute  the  Nyquist  firequency  (2  times  the  cutoff  frequencyX  and  2  times  the  stopband  fivquency. 
Two  times  the  stopband  fiwpiency  will  be  the  sampling  rate  for  each  channd. 

5.  Send  known  anaic^  signal  (such  as  sine  m  cosine  signals  with  high  and  low  ampltbufes  and  periodicities) 

through  fitters  fixr  A/D  coitverakm. 

6.  Reality  Check  Sample  at  different  ftequendes  64,  128,  256,  512,  1024  Hz  and  compute  errors  of 

magnitude  and  phase. 

7.  Send  known  ECG  signal  through  filters  and  A/D  to  the  peak  detection  algorithm  (8). 
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8.  SMUtjr  CkMk.  Over  plot  peak  (ktoction  with  odginal  signal.  Check  by  visual  inq)ecti<ai  fcff  the 

al^nment  of  dw  peak  detecdoo  with  the  R  peak. 

9.  Send  a  known  stgnal  sinniltaneouaiy  through  4  duumela  or  however  many  win  be  used  in  the  aocpiiiition. 

10.  Reality  Cheek.  Compute  errors  between  chemveto  due  to  sampling  time  sdectioa.  It  has  been  icftrred 

to  as  non-significaat.  Ifowevcr,  to  obtain  the  most  accurate  results  possflde,  this  error  needs  to  be 
quantified.  Note,  if  the  A/D  board  is  designed  specifically  for  simultaneous  sample  and  hold  this 
stq>  is  not  necessary. 

1 1.  Di^tize  the  data  focMn  the  ECG,  BP,  Thermistor,  and  other  channeis. 

12.  Store  data  in  binary  foimat  as  this  is  the  most  compact. 

13.  Determine  ECG  peaks  and  R4t.  intervals,  verilfyjiig  by  visual  inspeetiofL 

14.  Determine  systolic,  diastolic,  pulse,  and  mean  values  alcmg  with  their  intervals,  verifying  by  visual 

inspectkm. 

15.  Reality  check.  Send  known  signals  fitrou|^  the  blood  pressure  detection  device  and  make  sure 

measurements  are  valid. 

16.  Conprte  instantaneous  rates  using  the  Berger  et  al.  metlKKl  (SX  the  equivaloit  of  a  low  pass  window 

boxcar. 

17.  Make  instantaneous  plots  of  heart  rate,  SBP,  raP,  PP,  Mean  Pressure. 

18.  Reality  check.  Also  tend  a  known  lignai  tiirou^  the  boxcar  filter  and  the  steps  that  follow  below. 

19.  Subtract  D.C.  (xmqxnuot  from  data 

20.  Develop  two  seta  of  data  and  pass  one  through  a  ihaning  window. 

21.  ReaHfy  cfceck.  Look  at  side  lobe  leakage  from  the  iqrplicationoftfae  Hanning  window. 

22.  Compile  (FFT)^  for  the  data  with  and  without  the  Harming  filter  and  get  total  power  for  each  case. 

23.  Reality  check.  Compare  the  known  signal  power  spectrum  results  with  those  that  have  been  published 

by  Berger  etal.(S). 

24.  QnrectfixHmmiii^windowbyinuttviymgby  [total  power  ofiwHanmng/ total  power  Hanning. 

23.  Correct  for  boxcar  by  nudtiplyiiig  by  1/  W(f) 


26.  notPSDfrc»&0to0.4Hz.  This  is  not  normalized. 

27.  Take  square  root.  Mot  amplitudes  for  second  set  of  graphs.  Thu  is  not  normalized. 

28.  N(»maiize26and27forasecondsetofgrBphi 

29.  Letting  B  «  S,  M,  D,  P,  where;  S  is  the  systolic  pressure,  M  is  the  mean  pressure,  D  is  the  diastolu; 

pressure,  and  P  is  the  pulse  pressure,  then  conqnite  aoss  spectra  for  eadh  blood  pressure  varidrle 
widi  heart  rate  (7).  I  is  the  R4l  interval  measurement 

Cbi(0"X*b{0*Xi(0 = UfHQ(0 

L(f)  is  called  the  cospectrum  and  Q(0  is  the  quadrature  spectrum. 

30.  Conqnite  the  squared  coherence  estimator  (7). 

k2(0=  [(Uf))2-KQ(0)2l/[CBB(0Cii(0] 

31.  Compute  the  estimator  ofdie  phase  spectrum  for  each  vaiityUe  (7): 

£(f)=arc  tan  (-Q(fyL(f)) 

fl[f)  over  the  range  of +180  d^ees 

32.  Make  cross  spectra,  phase,  and  coherence  plots  of  SBP,  DBP,  PP,  and  Mean  Pressure  with  heart  rate.. 
SUGGESTIONS  FOR  IMPROVING  THE  ANALYSIS 

UnuU  Imtrmma^ 

The  LabVlEW  visual  instruments  (VTs)  can  undergo  substantial  improvement  in  the  firture.  The 
algorithnu  for  the  autor^ressive  apptxuch  can  be  added  to  the  series  of  VTs  most  probably  through  the 
linking  of  C  code  riKxiules.  Additionally,  enhancing  the  memory  allocation  and  the  overall  design  of  Are  VTs 
should  make  them  more  ^dent  in  die  data  analyaia.  the  additioiial  dgital  signal  prooesamg  board, 
real  time  analysis  may  be  a  possible  program  extension.  Algorithms  for  the  peak  ddecdon  under  nmsy 
conditions  should  be  added.  The  R-R  irderval  algoritfams  can  be  optimized  to  reduce  the  mertKiry 
requirements  of  die  programs  and  erdianoe  effideocy. 

Fm*n 

If  biological  variability  studies  are  to  be  conducted  on  a  continuous  basis,  it  may  be  appropriate  to 
purchase  commerdally  manu&otured  fitters.  Seieedng  a  filter  with  lesa  than  1%  attenuatioo  between  0  and 
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100  Hz,  a.  ottoffof  123  Hz  and  a  of  less  than  256  Hz  will  allow  each  dunnd  to  be  sampled  at  less 

than  S12  Hz.  This  wiB  reduce  the  borage  requirements  by  S0%  and  reduce  the  oomputationai  burden  by 
50%. 

Mtmmrf 

When  doing  the  variability  analyns,  lack  of  memoty  is  crippling  in  the  LabVIEWenviixMBncnt.  The 
programs  ran  more  oomfixtidily  (i.e.  larger  data  files  could  be  maapulated)  with  20  of  RAM  monory. 
However,  tlw  programs  still  had  problems  with  very  large  data  files.  ThcrefcMne.  32  or  64  NO}  of  high  speed 
SAM  would  be  more  a{q>roptiate,  and  compensate  for  inaiegant  {HOgramming. 

LatyiEW 

The  LabVIEW  progranmung  environment  is  very  effident  for  rapidly  developing  applicatioas. 
Howeva,  it  has  certain  annoying  problems  that  the  mariu&cturer  may  consider  rectifying.  Theyare: 

1.  LabVIEW  needs  on-line  help  in  addition  to  the  wire  paths 

2.  A  1-2  page  form  whh  pictures  of  the  icons,  their  name  and  what  page  in  the  manual  to  find  them, 

would  be  anrropiiite  for  novice  system  users. 

3.  Icon  rotatitm.  Due  to  the  rigid  wiring  structure,  the  wires  can  become  very  messy. 

4.  A  fijnction  to  zoom  in  or  out,  to  find  cutSTHTpositioo  in  a  large  diagram  (S.  Stranges  suggestion) 

5.  Easy  development  of  a  i^attiple  windowing  system  like  hficrosofi  Vfindows,  the  DEC 
Workstations,  or  even  the  Macintosh  whr-<  its  not  running  LabVIEW. 
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Abstract 

\fost  studies  of  human  performance  and  workload  have  been  limited  to  single  or  dual  task 
paradigms.  Recent  advances  in  desktop  canputers  have  made  possible  the  development  of  multiple 
ta^  batteries,  which  ate  more  applicable  to  many  real-world  tasks.  One  such  battery,  the  Multi- 
Attribute  Task  Battery  (MATB),  was  investigated.  PsychophysiolQgical  measures  combined  with 
performance  data  nuy  provide  useful  informatioa  about  workload,  especially  when  used  with 
multi|de  task  batteries.  The  Psychofdiysiological  Assessment  Test  System  (PATS)  can 
simultaneously  present  tasks  and  collect  numerous  channels  of  psychq^ysiological  data.  A 
demmstration  study  was  designed  to  evaluate  the  feasibility  of  using  the  MATB  and  die  PATS 
together  to  explore  performance  and  workload.  TWo  subjects  performed  dUfTeient  combinations  of 
tasks  from  the  MATB  while  EEG  and  heart  rate  data  were  cbilected  by  the  PATS.  Subjective 
ratings  of  workload  and  performance  measures  were  also  recorded.  Both  the  MATB  and  the 
PATS  were  found  to  be  powerful,  user-friendly  systems  that  should  lead  to  new  avenues  of 
research  in  human  performance  and  workload. 
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MULTIPLE  PHYSIOLOGICAL  MEASURES  OF  PERFORMANCE 
ON  A  MULTIPLE  TASK  BATTERY 

Scott  H.  Mills 


Human  performance  on  standard  laboratory  tasks  has  long  been  used  as  a  vehicle  for 
understanding  many  aspects  of  workload.  However,  most  studies  have  been  limited  to  the  use  of 
single  or  dual  ta^  paradigms.  While  these  research  settings  do  allow  for  the  investigation  of  many 
well-defined  aspects  of  human  performance,  the  findii^  can  be  difficult  to  apply  to  tasks 
performed  routinely  in  the  workplace.  Most  of  these  tasks  involve  several  different  processes, 
instead  of  just  me  or  two.  For  example,  driving  a  car  requires  simultaneously  perfomung  a  large 
number  of  sub-tasks.  Another  difficulty  of  applying  laboratory  tasks  exists  because  tiiey  are 
usually  relatively  short  in  duration  (usually  less  than  five  minutes).  This  presents  a  problem  with 
comparisons  to  real-world  tasks  that  are  often  much  longer  in  duration. 

One  solution  to  the  problem  of  short,  simple  laboratory  task  paradigms  is  the  use  of 
multiple  task  batteries.  Improvements  in  the  capabilities  of  desktop  computers  over  the  last  several 
years  have  made  the  development  of  these  batteries  more  feasible.  Chie  such  battery  retxntiy 
developed  at  NASA  is  the  Multi-Attribute  Task  Battery  (MATB;  Comstock  &  Amegard,  1992). 
This  battery  is  capable  of  presenting  multiple  tasks  on  the  same  computer  screen.  With  the  use  of 
laograrruning  scripts,  the  experimenter  can  vary  the  ctxnlnnation  of  tasks,  their  difficulty,  and  their 
duration.  The  MATB  inclwies  a  system  monittxing  task,  a  tracking  task,  an  audio 
ccnmnunications  task,  and  a  resource  management  task  (See  Bgure  1).  The  flexibility  of  this 
battery  makes  possible  research  for  both  single  and  multiple  task  performance  within  both  short 
and  long  duraticm  test  periods.  Short  task  periods  allow  greater  control  in  the  exploration  of  task 
effects  (such  as  type  of  task  and  workload  level)  on  psychoi^ysiology  and  performance,  while 
longer  ta^  allow  greater  ability  to  explore  the  effects  of  stressors  and  adaptive  responses  to 
workload. 

Another  feature  of  the  MATB  is  its  inclusion  of  a  subjective  wmkload  rating  instrument, 
the  NASA  Task  Load  Index  (TLX;  Hart  &  Staveland,  1988).  The  TLX  is  a  multiple  sitirscale 
measure  that  can  be  presented  at  any  time  during  performance  of  the  MATB.  Thus,  it  can  be 
easily  used  as  a  complement  to  performance  data  in  the  assessment  of  workload  (See  Hguie  2). 
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SYSTEM  MONITORING  TRACKING  SCHEDULING 


Ftguic  1.  The  MATS  screen. 


Figure  2.  TIic  NASA-TLX  workload  raiing  screen. 


Workload  assessment  usually  takes  the  foim  of  one  of  the  following  categories:  task 
performance,  subjective  measures,  or  psychophysiological  indices.  Of  these,  task  perfonnance  is 
most  often  dted  in  the  literature.  One  disadvantage  of  loc^g  only  at  task  performance  is  that 
actual  effort  or  workload  may  not  always  be  reliably  estimated.  This  is  because  subjects  may  be 
recruiting  different  amounts  of  resources  in  order  to  attain  similar  performance;  that  is,  some 
subjects  may  have  to  work  harder  to  achieve  the  same  perfonnance  as  others.  An  additional 
attempt  to  quantify  diffeteiKes  in  workload  is  the  use  of  subjective  rating  scales.  Subjective  data, 
however,  may  be  subjea  to  bias  and  expectations.  It  also  can  be  comprcxnised  by  fuoblems 
associated  with  the  delay  involved  between  actual  task  performance  and  responding  to  the  ratings. 

Psychophysiological  methods  of  assessment  have  several  advantages  over  performance  and 
subjective  measures.  These  measures  can  be  made  continuously,  and  therefore  reveal  dranges  in 
patterns  of  psychophysiological  activity  over  time.  Another  advantage  is  that  the  coUectitm  of 
these  measures  does  not  intrude  on  the  subject's  primary  task. 

The  management  of  such  varied  sources  of  task,  subject,  and  operational  environment  data 
does  pose  as  a  significant  problem.  One  important  advance  in  addressing  this  problem  has  been  the 
development  of  die  Psychophysiological  Assessment  Test  System  (PATS),  which  has  been 
developed  by  the  psychqrhysiological  lalxnatory  of  the  Armstrong  Aerospace  Medical  Research 
Labtsatory  at  Wright-Patierson  AFB.  The  PATS  was  designed  to  be  a  comprehensive  test  system 
the  presentation  of  tasks  and  for  die  measurement  of  psychophysiological  data.  It  can  record 
and  analyze  numerous  charmels  of  such  data.  It  also  has  the  capability  of  communicating  with 
other  computer  systems  such  as  those  generating  tasks,  and  other  data  ouqrut  devices  (such  as 
(figitized  or  analog  signals  for  sensors  or  controllers  of  various  sorts).  These  features  make  the 
PATS  an  ideal  system  for  use  with  multiple  task  battery  research. 

The  combination  of  the  MATB  and  the  PATS  provides  a  flexible  laboratory  system  for 
investigating  human  performance  and  workload  under  a  wide  variety  of  experimental  designs.  The 
MATB  can  present  multiple  combinations  of  different  types  of  tasks  for  both  short-interval  and 
Irmg-interval  durations.  The  PATS  can  record  up  to  sixteen  channels  of  physiological  data  and 
{»ovide  several  levels  of  analysis.  Both  these  systems  are  designed  to  optimize  usability  by  the 
researcher.  The  PATS  system  has  a  user  interface  based  on  HyperCard  and  requires  no 
programming  The  MATB  includes  an  easy-to-use  setup  screen  and  requires  only  the  writing  of 
script  files  consisting  of  times  and  one  or  two-word  event  codes.  This  ease  of  use  makes  the 
combination  of  the%  two  systems  a  very  practical,  useful  research  envirtximent  that  may  provide 
new  avenues  to  investigate  complex  areas  of  human  perfonnance  and  workload. 

The  purpose  of  this  study  was  to  evaluate  die  feasibility  of  using  the  MATB  and  the  PATS 
together  for  axrducting  research  in  human  perfonnance.  An  experiment  was  designed  to  present 
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subjects  with  different  combinatiais  of  tasks  and  workload  levels  cn  the  MATS  and  to  record 
psychophysiological  responses  with  the  PATS  for  canparison  to  performance  and  subjective  data. 

Method 

Subjects 

Two  males  subjects,  ages  25  and  21  panicipamd  in  this  study. 

Apparatus 

The  Multi-Attribute  Task  Battery  (MATB)  provides  the  subjea  with  a  complex 
cmnbinatitm  of  tasks.  The  battery  presents  the  tasks  running  in  individual  windows  on  a  standard 
PC  screen  and  comprises  the  four  tasks  described  below. 

System  Monitoring  task.  This  task  is  presented  in  the  upper  left  portion  of  the  screen,  where  two 
lights  and  four  dials  arc  presented.  The  subject's  job  is  to  monitor  these  (Ejects  for  changes  from 
their  "normal"  conditiotL  When  a  change  is  detected,  the  subject  must  click  with  the  mouse  on  the 
aberrant  object  In  the  normal  condition  the  left  li^t  is  on  and  is  green.  A  change  is  defined  as 
when  this  light  turns  off.  The  right  light  is  normally  off.  and  a  change  in  it  is  defined  when  it  turns 
red.  The  pointers  in  each  of  the  dials  normally  fluctuate  around  the  center  mark,  staying  within  one 
mark  below  and  one  mark  above.  A  change  in  a  dial  is  defined  as  when  the  pointer  begins 
fluctuating  beyond  one  mark  on  either  side  of  the  center. 

Tracking  task.  This  task  is  presented  in  the  upper  center  of  the  screen.  The  subjea  controls  the 
target  (the  circle)  with  a  joystick  and  must  attempt  to  keep  the  target  in  the  center  of  the  window. 
This  ta.sk  has  three  difficulty  levels.  Tracking  can  also  be  switched  to  an  automatic  mode,  in  which 
the  target  stays  near  the  center  of  the  window  without  any  input  from  the  subject  During  this 
mode  the  word  "AUTO"  appears  at  the  bottom  of  the  window. 

Resource  Management  task.  This  task  appears  in  the  window  at  the  bottom  center  of  the  screea 
A  diagram  representing  fuel  tanks  arxl  pumps  ccnnecting  them  is  presented.  The  objective  of  tins 
task  is  to  maintam  the  level  of  fuel  in  the  upper  two  tanks  at  2500  gallons.  These  tanks  are  drained 
of  fuel  at  a  constant  rate;  therefore  the  subjea  must  turn  on  and  off  pumps  to  control  the  transfer 
of  fuel  ftom  the  lower  tanks.  Pumps  that  arc  turned  cm  arc  green  in  color,  and  pumps  that  arc  off 
arc  clear.  The  subjea  can  toggle  pumps  by  using  the  mouse  to  click  on  them.  The  levels  of  fuel  in 
four  of  the  tanks  arc  indicated  by  the  number  below  each  tank.  The  other  two  arc  supply  tanks  and 
have  unlimited  capacity.  The  flow  rate  for  each  pump  is  indicated  in  the  "Pump  Status"  winckw  in 
the  lower  right  portion  of  the  screeiL  When  a  pump  if  off,  this  rate  is  displayed  as  zero. 

Any  fuel  pump  can  be  made  to  fail  by  the  program,  at  the  discretion  of  the  experimenter. 
When  tins  occurs,  the  pump  turns  red  and  stops  transferring  fuel.  The  subjea  has  no  control  over 
diese  failures  and  cannot  control  the  pump  during  a  failure.  Pumps  can  also  be  repaired  by  the 
program.  When  this  happens,  the  pump  is  returned  to  the  off  condition;  therefore  the  subject  must 
actively  turn  it  back  on  if  desired. 
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Communications  task.  The  communication  task  is  presented  in  the  window  in  the  lower  left 
portion  of  the  screen.  This  window  displays  the  subject’s  call  sign  and  the  settings  for  four  aircraft 
radi<».  The  subject  must  listen  through  headphones  for  messages  directed  to  his  or  her  callsign  and 
follow  instructions  to  change  one  of  the  radio  frequency  settings  to  a  specific  irew  one.  This  task 
was  not  used  in  the  present  study. 

■Sdieriiiling  Window.  This  window  at  the  top  right  of  the  screen  allows  subjects  to  anticipate  when 
the  performance  of  either  the  tracking  or  communication  task  will  be  required.  It  w  a  not  used  in 
the  current  study. 

Rating  Scale  Screen.  This  screen  (See  Figure  2)  displays  the  NASA  Task  Load  Index  (TLX).  It 
can  be  programmed  to  a{^ar  at  any  time  during  performance  of  the  battery.  While  the  TLX  is 
presented,  the  battery  is  frozen  and  resumes  when  the  subjea  exits  the  TLX  screen  or  when  the 
maximum  time  for  conpleting  the  ratings  (one  minute)  has  been  exceeded.  The  TLX  measures 
subjective  workload  by  requiring  the  subject  to  rate  their  experience  of  woikload  on  six  subscales: 
Mental  Demand,  Physical  Demand,  Temporal  Demand,  Performance,  Effort,  and  Frustration.  The 
subject  uses  the  mouse  to  selea  a  rating  value  for  each  subscale.  The  analysis  of  these  selections 
is  explained  in  the  Results  sectioa 

The  special  nature  of  this  research  recpiires  a  powerful  and  flexible  s>^m  for  the 
collection  and  management  of  psychophysiological  data.  The  Psychophysiological  Assessment 
Test  System  (PATS)  uses  a  Macintosh  IIx  computer  equipped  with  commercially  available  data 
acquisition  boards  and  a  high-capacity  data  storage  device.  The  PATS  has  the  ability  to 
simultaneously  present  tasks  and  collea  both  performance  and  psychophysiological  data  for 
periods  of  up  to  several  hours.  It  can  digitize  up  to  16  channels  of  atudog  data  at  a  KXX)  Hz  rate 
for  each  channel,  and  also  provides  several  analysis  tools  such  as  digital  filtering,  spectral  analysis, 
etc.  The  PATS  can  be  set  up  to  present  tasks  and  collect  data  in  a  wide  variety  of  ccxnbinations 
through  the  use  of  menu-driven  screens.  For  the  present  study,  only  the  psychophysiological 
ccdlecdm  and  analysis  components  of  the  PATS  were  used. 

Task  Conditions 

Eight  task  ctmditimis  were  developed,  using  various  combinations  of  the  Monitoring, 
Tracking,  and  Resource  Management  tasks  (See  Figure  3).  Three  of  these  conditions  consisted  of 
one  of  the  tasks  being  perforated  by  itself.  One  conditimi  was  a  baseline,  in  which  no  tasks  were 
performed,  but  the  subjea  viewed  the  MATE  screen  and  sat  quietly.  Three  conditions  were 
designed  to  require  low,  medium,  and  high  woikload  demands  of  the  subject  Finally,  one 
condition  was  designed  to  demand  an  extremely  high  level  of  workload  on  all  three  tasks  for  four 
minutes  (overload),  and  then  to  require  cmly  a  moderate  level  fra  4  minutes  (medium). 


Task  Condition  Summary 


Cmdition 

Task  Status 

Duration 

System  Monitoring 

Res.  Management 

BaselirK 

automatic 

automatic 

automatic 

4  minutes 

Monitoiing 

2  stimAnin 

automatic 

automatic 

4  minutes 

automatic 

Low 

automatic 

4  minutes 

automatic 

automatic 

On-No  Failures 

4  minutes 

Low 

2  StimAnin 

automatic 

On-No  Failures 

4  minutes 

Medium 

2  StimAnin 

Low 

On-No  Failures 

4  minutes 

Hirii 

4  StimAnin 

Medium 

1.25  Failures/min 

4  minutes 

Overload/ 

Medium 

14  StimAnin 

High 

8  Failures/min 

8  minutes 

2  StimAnin 

Low 

On-No  Failures 

Rgure3.  Task  Ccxiditions. 


Procedure 

This  study  consisted  of  two  phases,  a  training  phase  and  a  test  phase.  Each  phase  lasted 
approximately  two-hours,  and  each  phase  occurred  on  a  different  day.  Subjects  were  trained  and 
tested  individually.  For  the  training  session,  the  subject  was  provided  with  a  demonstration  of  the 
Mmutoring,  Tracking,  and  Resource  Management  tasks,  as  well  as  the  TLX  rating  screen.  The 
subject  was  encouraged  to  ask  questions  and  to  practice  each  procedure.  Then  each  subject 
practiced  the  MATS  for  a  total  of  72  minutes.  This  practice  session  comprised  a  combination  of 
the  (TKXi-baseline)  4-minute  conditions  described  above.  The  overload  condition  was  not  included 
Each  four-minute  trial  was  followed  by  the  TLX  rating  screea 

Fbr  the  testing  phase,  each  subjea  was  instrumented  to  permit  recording  of  several 
standard  measures  used  in  human  performance  research  in  workload  assessment  These  were  heart 
rate,  eyeblink.  arxl  four  channels  of  EEC  data.  The  subject  was  seated  in  an  electrically  shielded 
acoustically  insulated  chamber,  equipped  with  mouse  and  joystick  controls  and  a  window  for 
screen  viewiirg.  Each  of  the  four-minute  task  conditions  of  the  MATE  was  presented  in  a  random 
order,  with  the  exception  of  the  baseline  condition,  which  was  always  presented  as  the  fifth  trial. 
After  eadi  four-minute  condition,  the  subject  was  allowed  three  minutes  to  return  to  baseline. 
During  this  break  period  the  door  to  the  experimental  chamber  was  opened,  and  the  experimenter 
told  the  subject  which  of  the  tasks  he  would  be  perfonning  in  the  next  presentation.  The  subject 
was  not  told  the  difficulty  of  the  upcoming  tasks,  only  which  ones  would  be  required.  Following 
the  last  four-minute  condition,  the  ei^t-minute  "overload-medium"  ctmdition  was  presented . 
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Subjective  Ratings 

The  NASA  Task  Load  Index  was  presented  immediately  following  completion  of  each  task 
addition  (during  the  break  period--for  no  more  than  cme  minute)  cm  the  same  screea  Following 
the  eight-minute  conditicm,  the  subject  was  asked  to  complete  the  TLX  for  the  "first  half'  of  the 
task,  the  overload  portion.  After  completing  the  last  TLX  rating,  the  subject  exited  the  chamber 
and  performed  the  Aftermat  program.  Aftermat  is  a  oxnponent  of  the  TLX  that  requires  the 
subject  to  make  factor  comparisons  of  each  pair  of  subscale  tides.  Its  results  are  used  in  the 
ctxnputadon  of  the  TLX  wei^ted  workload  scores. 


4-min 

4-min 

4-min 

4min 

baseline 

4-min 

4-min 

4-min 

overload 

4-min 

medium 

TLX 

TLX 

TLX 

TLX 

TLX 

TLX 

TLX 

Time  — > 


Results 

This  study  was  designed  as  a  pilot  project  to  determine  the  feasibility  of  using  the  MATB 
and  the  PATS  systems  together.  Because  of  the  exploratory  nature  of  the  design  and  the  low 
number  of  subjects,  only  descriptive  statistics  will  be  reported. 

The  following  data  were  collected  by  the  MATB:  for  Monitoring,  response  time  for 
correct  responses,  number  of  false  alarms,  and  number  of  missed  stimuli;  for  Tracking,  Root  Mean 
Square  error  (RMS);  and  for  Resource  Management,  mean  deviation.  This  deviation  was 
commuted  by  the  following  method;  Each  time  the  status  of  a  fuel  pump  changed,  the  absolute 
deviations  from  the  criterion  hiel  level  for  both  upper  tanks  were  measured  and  added  together. 

The  mean  of  these  numbers,  over  a  four-minute  ttiaL  is  reported  as  the  mean  deviation.  These 
MATB  performance  data  are  presented  in  Figure  4.  Several  characteristics  can  be  noted:  First, 
performance  for  all  single-task  conditions  was  superior  to  that  for  any  dual  or  mutli-task 
conditions.  In  ccmtrast,  task  measures  did  not  necessarily  degrade  linearly  with  the  hypothe^d 
increases  in  workload.  Performance  for  the  final  medium  condition  was  expected  to  return  to 
aj^roximately  the  level  for  the  first  medium  oxiditioa  However,  both  Tracking  and  Resource 
Management  appear  to  be  anomalous.  Tracking  performance  for  both  subjects  was  better  than 
expected,  and  Resource  Management  performance  aj^ars  to  be  worse.  It  should  be  noted, 
however,  that  the  mean  deviation  repotted  for  Resource  Management  probably  reflects  degraded 
performance  from  the  Ovefioad  conditioa  This  is  because  when  a  subject  changes  his  or  her 
performance,  it  is  not  reflected  in  the  measure  of  fuel  deviation  until  the  tanks  "catch  up"  with  that 
change. 
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As  previously  mentioned,  the  psychof^ysiolopcal  measures  were  heart  rate,  eyeblink,  and 
four  channels  of  EEG  data.  The  EEG  data  were  iccorxled  at  locations  Oj,,  Pj^,  and  Fj  (See  10- 
20  Electrode  Placement  System,  Ja^r,  1958).  Spectral  analysis  was  perfonned  and  mean  power 
was  calculated  for  each  of  the  following  bandwidths:  Delta  (up  to  3  Hz),  Theta  (4-7  Hz),  Alpha 
(8-13),  and  Beta  (14-25).  Ordinarily,  an  experimenter  can  use  the  PATS  to  subtract  eyeblink 
signals  from  those  of  the  EEG.  Because  of  a  configuration  error,  eyeblink  dau  for  this  study  was 
rtot  usable.  Therefore,  it  is  not  reported,  and  the  reported  EEG  data  includes  some  eyeblink 
artifact  Fm-  heart  rate,  the  mean  interbeat  interval  (IBI)  for  each  task  condition  is  reported.  These 
data  are  presented  in  Figures  5  and  6.  The  eyeblink  artifaa  makes  interpretation  of  EEG  data 
difficult  Heart  rate,  however,  did  appear  to  correspond  to  some  differences  in  task  condition.  For 
both  subjects,  heart  rate  was  relatively  slow  during  the  baseline  condition  and  relatively  fast  during 
overload. 

The  NASA  TLX  program  produces  a  wei^ted  workload  score  for  each  rating  performed. 
This  score  is  computed  from  weighted  subscale  ratings.  These  scores  are  presented  in  Bgure  7. 
These  data  are  interesting  because  for  certain  condititms,  they  parallel  some  of  the 
psychophysiological  data.  For  example  for  the  low,  medium  (first),  and  high  ctmditions.  Subject  1 
rated  medium  as  having  the  least  amount  of  workload  and  had  the  lowest  heart  rate  of  those  three 
conditions.  Subject  2  rated  these  conditions  in  a  more  linear  manner,  and  his  heart  rate  followed 
the  same  trend  (with  heart  rate  increasing  as  reported  workload  increased). 
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Subject  1  _ _ _  Task  Condition  _ _ _ 

VariaUe  Tracking  Res  Mngmnt  Monitoring _ Low _ Medium  High  Overicad  Medium* 


Figure  4.  MATB  Performance  Means, 
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Figure  5.  Psychophysiological  Measures  for  Subject  1. 
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Figure  6.  Psychophysiological  Measures  for  Subject  2. 
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Figure  7.  NASA  TLX  weighted  workload  scores. 


Discussion 


This  study  included  only  two  subjects  because  of  its  purpose-to  examine  the  feasibility 
of  using  the  PATS  and  the  MATB  concurrently  in  the  investigation  of  human  perfonnance  and 
workload.  This  small  sample  size  makes  the  formation  of  any  conclusicms  about  the  reserved  data 
difficult  Regardless,  some  general  trends  in  the  data  are  encouraging.  It  is  also  possible  that  any 
meaningful  patterns  in  the  EEG  data  may  have  been  obscured  by  interference  from  eyeUink 
signals. 

Noietheless,  the  piimary  purpose  of  the  study  was  accomplished.  Both  the  PAT^  and  the 
MATB  performed  veiy  successfully  and  their  combination  provided  a  robust  reliable  research 
environment  The  MATB  was  easily  configured  for  a  variety  of  task  combinations.  It  presented 
tasks  and  recorded  both  performance  and  subjective  workload  rating  data  reliably.  The  PATS  also 
proved  to  be  flexible  and  easy  to  use,  for  both  psychophysiological  data  collection  and  analysis. 

The  user  interface  is  based  on  HyperCard  and  uses  a  hierarchy  of  menus.  This  interface  is  very 
user-ftiendly  and  provides  extensive  error-checking  of  user  input  The  PATS  also  manages  and 
analyzes  large  amounts  of  data  with  a  maximum  of  ease  for  the  user.  These  features  make  the 
PATS  a  powerful  and  versatile  laboratory  system  with  the  potential  of  facilitating  a  wide  variety  of 
psychophysiological  research. 

Multiple  task  battery  research  combined  witit  the  analysis  of  multiple  psycho{^ysiological 
measures  may  lead  to  better  understanding  of  many  facets  of  human  performance  and  workload. 
The  PATS  and  MATB  systems  ^>pear  to  be  valuable  tools  when  used  both  individually  and  in 
combinati<xi  and  should  provide  many  new  avenues  of  productive  research. 
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A  NONRADIOACTIVE  ASSAY  FOR  THE  DETECTION 
OF  AP-1  COMPLEXES  IN  THE  SCN  OF  HAMSTERS 


Heather  Panek 

Graduate  Teaching  Assistant 
Department  of  Biology- 
University  of  Scranton 

Abstract 

The  primary  pacemaker  generating  circadian  rhythms  is 
located  in  the  suprachiasmatic  nuclei  (SCN)  of  the  hypothalamus. 
Light  and  the  circadian  pacemaker  interact  to  regulate  a  specific 
set  of  immediate  early  genes  in  the  SCN  that  may  participate  in 
the  entrainment  of  the  circadian  clock.  Specifically,  c-fos 
related  proteins  interact  with  a  member  of  the  Jun  family  to  form 
the  transcription  factor  complex  AP-1  that  can  bind  to  a 
particular  DNA  sequence  (TGACTCA).  This  DNA-protein  complex 
interaction  can  be  isolated  in  a  gel  mobility  shift  assay.  To 
detect  the  AP-1  complex,  a  nonradioactive  system,  based  on 
previously  described  radioactively  labeled  DNA  probe  protocols, 
has  been  researched.  DNA  probes  labeled  nonisotopically  with 
f luorescein-ll-dUTP  can  be  detected  with  antibody-enzyme 

conjugates  and  enzyme  substrates.  This  technique  is  desirable 
for  several  reasons  including;  short  exposure  times,  long  storage 
of  labeled  probes  and  the  added  safety  of  nonisotopic  systems. 
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A  NONRADIOACTIVE  ASSAY  FOR  THE  DETECTION 
OF  AP-1  COMPLEXES  IN  THE  SCN  OF  HAMSTERS 


Introduction 

The  SCN  of  the  hypothalamus  in  mammals  is  the  site  of  a 
circadian  pacemaking  system  and  is  entrainable  by  light  during 
subjective  night.  (1)  However,  the  specific  mechanisms  that  set 
the  rhychm  keeping  clock  are  unknown.  It  has  been  noted,  that  in 
response  to  light  at  appropriate  circadian  times,  phase  shifts 
and  a  corresponding  increase  in  the  mRNA  of  c-fos  and  jun-B 
occur.  This  increase  is  only  observed  in  response  to  light 
stimulation  at  times  that  also  cause  phase  shifts.  The  resulting 
Fos  proteins  can  interact  with  Jun  proteins  to  form  a 
transcription  factor  AP-1  complex. (2)  This  AP-1  complex  contains 
a  specific  DNA  binding  site,  TGACTCA.  (3)  The  protein  is  able  to 
bind  to  DNA  outside  of  the  helix  without  disturbing  base  pairing. 
These  AP-1  complexes  can  be  detected,  after  light  stimulation 
during  subjective  night,  by  a  gel  mobility  shift  assay.  (2)  Use 
of  this  assay  would  be  helpful  to  determine  if  there  are  AP-1 
complex  increases  in  response  to  stimuli  other  than  light,  such 
as  certain  drugs  that  alone  can  cause  circadian  phase  shifting. 
If  AP-1  is  seen  in  response  to  other  stimuli  it  will  be  further 
proof  that  c-fos  and  jun-B  are  part  of  the  expression  cascade  in 
the  generation  of  circadian  rhythms  However,  present  mobility 
shift  assays  have  some  disadvantages  associated  with  them  since 
they  are  based  on  radioactive  detection  systems.  (2) (4) 
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There  are  nonisotopic  labeling  and  detection  systems  available  to 
replace  radioactive  methods,  but  they  have  not  been  applied 
specifically  to  mobility  shift  assays.  Therefore  an  effort  was 
made  to  adapt  these  nonisotopic  kits  to  previously  described 
mobility  shift  protocols  and  to  determine  the  conditions 
necessary  to  detect  AP-1  complex  formation. 

Gel  mobility  shift  assays  are  based  on  the  fact  that  the 
mobility  of  DNA  complexed  to  a  protein  is  greatly  reduced  in  a 
polyacrylamide  gel.  (5)  Because  of  this,  non-bonded  DNA  and 
proteins  will  run,  in  a  low  percentage  gel,  well  in  front  of  any 
complexes,  making  the  complex  easily  detectable. 

To  perform  a  mobility  shift,  DNA  in  short  oligonucleotide 
form,  containing  the  binding  protein  consensus  site,  is  end 
labeled  with  terminal  deoxynucleotidyl  transferase,  an  enzyme 
that  will  repeatedly  add  nucleotides  to  the  3*0H  termini  of 
oligonucleotides.  (6)  Then  the  labeled  oligonucleotide  is 
incubated  with  the  DNA  binding  protein  and  binding  buffers.  (4) 
The  entire  assay  is  run  on  a  nondenaturing  polyacrylamide  gel  and 
labeled  DNA-protein  complexes  are  then  detected. 

The  first  nonisotopic  system  tested  was  Amersham's  ECL 
(enhanced  chemiluminescence)  3'  oligolabeling  and  detection 
system.  It  uses  f luorescein-ll-dUTP  and  terminal 

deoxynucleotidyl  transferase  to  label  the  3*  end  of  standard 
oligonucleotides.  A  single  labeling  reaction  labels  from  25  to 
lOO  picomoles  of  oligonucleotide  in  60  to  90  minutes,  adding 
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approximately  6-8  nucleotides.  (7)  Labeled  probes  can  be  stored 
in  the  dark  at  -  20  *  C  for  up  to  five  months.  The  labeled 
oligonucleotide  is  incubated  with  AP-1  containing  SCN  extract  and 
separated  on  a  4%  polyacrylamide  gel.  After  electrophoresis  the 
complexes  must  be  transferred  to  a  solid  support.  The  complex  is 
electroblotted  onto  a  positively  charged  nylon  membrane 
(Boehringer  Mannheim)  for  one  hour.  The  DNA  of  the  complex  is 
fixed  to  the  nylon  membrane  by  ultraviolet  radiation  that 
crosslinks  thymine  residues  in  the  DNA  to  positively  charged 
amino  acids  on  the  surface  of  the  membrane.  (9)  After  fixation, 
the  membrane  is  blocked  to  prevent  non-specific  antibody-enzyme 
conjugate  binding,  a  major  source  of  high  background.  Then  the 
membrane  is  incubated  with  anti-fluorescein  horseradish 
peroxidase  conjugate  that  will  bind  to  the  fluorescein  labeled 
complexes.  Detection  occurs  when  the  horseradish  peroxidase 
catalyzes  the  oxidation  of  the  detection  substrate,  luminol. 
Immediately  following  oxidation,  luminol  is  in  an  excited  state 
which  can  decay  to  the  ground  state  by  a  light  emitting  pathway. 
Amersham's  detection  kit  also  contains  an  enhancer  that  magnifies 
the  reaction  1000  fold  and  can  be  detected  on  blue  light 
sensitive  film,  with  a  maximiim  emission  of  428nm.  The  light 
emission  reaction  peaks  after  1-5  minutes  before  decaying  over  2- 
3  hours.  (10)  Sensitivity  for  these  probes  is  25  attomoles 
compared  to  1-5  attomoles  for  **P. 

The  second  system  tested  was  Boehringer  Mannheim's  DIG 
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Oligonucleotide  3 '-End  Labeling  Kit  (Genius  5).  The  overall 
protocol  for  Genius  5  is  very  similar  to  the  ECL  system  even 
though  they  utilize  different  labeling  and  detection  reagents. 
In  the  Genius  5  system,  digoxigenin  (DIG),  a  steroid  hapten,  is 
linked  via  a  spacer  arm  to  a  nucleotide.  Oligonucleotides  are 
labeled  by  the  incorporation  of  a  single  DIG  labeled  nucleotide, 
DIG-ddUTP.  (11)  After  the  binding  assay  and  electrophoresis,  the 
DIG-labeled  oligonucleotide  probes  are  detected  by  an  immunoassay 
utilizing  antidigoxigenin-alkaline  phosphatase  conjugate.  To 
visualize  AP-1  and  oligonucleotide  complexed  molecules,  Lumigen, 
a  component  of  Lumi-  Phos  530,  is  dephosphorlyated  by  alkaline 
phosphatase  forming  an  unstable  intermediate.  This  intermediate 
decays  and  emits  blue  light  in  direct  proportion  to  the  amount  of 
alkaline  phosphatase  present.  Lumi-Phos  also  contains  a 
fluorscein  surfactant  that  emits  bright  yellow  luminescence  at 
525  nm,  greatly  enhancing  the  detected  signal.  Lumi-Phos  530 
allows  multiple  exposures  of  the  blot  since  the  light  intensity 
increases  for  7-8  hours,  is  maximal  for  12  hours  and  slowly 
decreases  over  several  days.  (12)  The  Genius  5  system  is  reputed 
to  have  sensitivity  equal  to  ’“P. 

Methods  and  Materials 

Paired  SCN  were  removed  from  Syrian  hamsters  after  two  hour 
light  stimuj.ation,  beginning  at  CT  19.  The  tissue  from  two 
hamsters  was  pooled  into  one  sample,  frozen  on  dry  ice  and 
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stored  at  -  80  C  until  sonication  as  described  by  Kornhauser  et 
al.  (2)  Soluble  proteins  were  obtained  after  25  minutes  of 
microcentrifugation  at  lOOOOg  and  at  4*0.  AP-1  consensus  site 
oligonucleotides  were  purchased  from  Promega.  The 

oligonucleotides  are  double  stranded,  21  nucleotides  long  and 
contain  a  single  consensus  site  (5"-CGCTTGATGACTCAGCCAA-3" ) . 
Oligonucleotides  were  end  labeled  with  fluoresce in- 11-dUTP  and 
terminal  transferase  according  to  Amersham's  3'  oligolabeling  and 
detection  system.  Labeling  efficiency  was  checked  by  the  manu¬ 
facturer's  rapid  labeling  assay.  The  labeled  oligonucleotides  do 
not  have  to  be  purified  and  are  stable  in  a  dark  environment  at 
-20 for  up  to  five  months.  Mobility  shift  binding  assays  were 
based  on  previously  reported  protocols.  (2)  (4)  Ten  microliters 
of  tissue  extract,  4^  1  5x  binding  buffer  [  20%  glycerol,  5mM 
MgCl*  ,  2.5  mM  EDTA,  2.5  mM  DTT,  250  mM  NaCl,  50mM  Tris-Cl  (pH 
7.5),  lOO^g/ml  denatured  salmon  sperm]  and  dH^O  to  20  ^1 
(including  labeled  oligonucleotide)  were  incubated  on  ice  for  15 
minutes.  Approximately  22ng  of  oligonucleotide  was  added  and  the 
reaction  continued  at  room  temperature  for  45  minutes.  Reactions 
were  stopped  with  the  addition  of  lOx  loading  buffer  [40% 
glycerol,  250mM  Tris-Cl  (pH  7.5),  0.2%  bromophenol  blue].  The 
mobility  shift  was  performed  on  a  pre-run  4%  polyacrylamide  gel 
at  150  volts  for  approximately  one  hour  in  25mM  Tris,  192mM 
glycine  buffer.  The  samples  were  then  electroblotted  onto  a 
positively  charged  nylon  membrane  (Boehringer  Mannheim)  for  one 
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hour  at  90  volts  in  0.5x  TBE  buffer.  The  DNA  complexes  were 
fixed  to  the  membrane  by  ultra-  violet  crosslinking  for  three 
minutes.  The  membrane  was  blocked  with  Amersham's  blocking  agent 
for  30  minutes,  incubated  with  ant i~ fluoresce in  horseradish 
peroxidase  conjugate  and  washed  as  suggested  in  the  ECL  detection 
system  protocol.  The  blot  was  exposed  to  ECL  Hyperfilm 
(Amersham)  for  three  hours  or  overnight  and  developed. 

The  Genius  5  system  was  used  according  to  manufacturer's 
instructions  which  were  similar  to  the  ECL  system's.  A  major 
difference  with  the  Genius  detection  kit  was  the  exposure  times 
needed.  The  Lumi-Phos  530  gave  off  a  much  stronger  signal  for  a 
much  longer  time.  This  cut  the  exposure  time  to  minutes  instead 
of  hours  and  multiple  exposures  were  possible. 

Results  and  Discussion 

There  are  several  adjustments  that  must  be  made  to  the  gel 
mobility  shift  assay  before  the  switch  to  a  nonisotopic  system 
can  occur.  Both  Amersham's  and  Boehringer  Mannheim's  systems 
require  larger  concentrations  of  oligonucleotide  in  the  labeling 
reactio,!  versus  radioactive  protocols.  In  some  cases,  purchased 
oligonucleotides  did  not  come  in  great  enough  quantity  to  perform 
labeling  reactions  of  optimal  concentration,  even  though  there  is 
more  than  enough  for  radioactive  methods.  Another  change  for 
both  kits  was  that  the  time  needed  for  the  binding  assay  was 
tripled  compared  to  traditional  protocol.  (4) 
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The  ECL  system  required  much  more  fluorescein  labeled  probe 
for  the  binding  assay  than  either  radioactive  methods  or  ECL 
literature  predicted.  This  observation  may  stem  from  the  fact 
that  the  Amersham  labeled  probe  was  not  purified  (per 
instructions).  If  the  labeling  reaction  was  not  as  efficient  as 
expected,  much  unlabeled  oligonucleotide  could  be  included  in  the 
22ng  of  probe  used  in  the  binding  assay.  Another  possibility  is 
that  there  is  a  loss  of  labeled  oligonucleotide  in  the 
electroblot  transfer  and  the  antibody  washing  steps,  which  would 
result  in  decreased  signal  output.  Detection  times  for  Amersham 
were  approximately  two  to  three  hours  or  overnight  if  desired, 
although  no  increase  in  detection  time  will  be  seen.  Using 
plenty  of  probe  resulted  in  a  fairly  strong  signal  and  very 
little  background.  Unfortunately,  the  ECL  detection  system 
reaches  a  maximum  very  quickly  (1-5  minutes)  and  decays  rapidly 
30  multiple  or  long  exposures  for  weaker  signals  is  impossible. 

The  Genius  5  system,  which  is  technically  almost  identical 
to  Amersham' 3,  has  a  major  advantage  in  that  its  detection  signal 
has  a  much  longer  time  course.  This  makes  multiple  exposures 
with  widely  varying  times  possible.  The  Genius  system  did  not 
require  as  much  labeled  probe  to  be  added  to  the  binding  assays . 
Approximately  8  ng  were  used.  This  is  probably  because  the 
Genius  instructions  recommend  purification  of  the  labeled  probe 
and  include  a  method  to  estimate  labeled  probe  yield.  Being  able 
to  determine  labeled  probe  concentration  is  also  an  advantage 
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over  Amersham'a  system.  K-gh  background  has  been  a  problem  with 
the  Genius  systems,  but  it  has  been  reduced  with  new  blocking 
agents  and  increased  suggested  blocking  times. 

After  finding  the  best  way  to  apply  these  systems  to 
mobility  shifts  there  still  remains  the  problem  that  no  AP-1  was 
detected.  Since  this  complex  has  been  previously  seen  in 
response  to  light  stimuli  (2),  there  must  be  some  technical 
reason  AP-1  does  not  appear  in  the  mobility  assay.  Either  the 
protein  is  not  being  extracted  from  the  SCN  tissue  or  it  is  being 
degraded.  Experimenting  with  nuclear  extracts  of  the  SCN  tissue 
and  more  rigorous  precautions  against  peptidases  should  produce 
Ap-1  protein.  Also  purified  Ap-1  should  be  used  in  a  binding 
assay  to  insure  that  the  systems  are  capable  of  binding  and 
detecting  the  complex. 

If  the  problem  of  no  AP-1  detection  continues  with  the 
mobility  shift  assay,  perhaps  a  slightly  different  approach  is 
necessary.  A  southwestern  hybridization  (8)  can  be  used.  This 
technique  is  performed  by  running  only  the  protein  extract  on  a 
polyacrylamide  gel  and  transferring  the  proteins  to  a  solid 
support  (western).  A  southern  hybridization  with  labeled 

consensus  oligonucleotide  could  then  used  to  detect  AP-1  protein. 

In  conclusion,  with  a  little  more  work,  the  nonradioactive 
systems  can  be  viable  alternatives  to  the  more  dangerous  and 
ultimately  more  expensive  **7  based  mobility  shift  assays.  The 
non-  isotopic  systems  are  faster  and  the  five  month  probe  storage 
is  a  real  advantage.  Once  the  nonisotopic  systems  have  been 
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completely  adapted  for  the  gel  mobility  shift  there  will  be  no 
reason  to  return  to  radioactive  systems. 
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GROWTH  OF  MICROBUBBLES  AT  ALTnUDE 


Joseph  Peilettiere 
Graduate  Student 
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Case  Western  Reserve  University 

ABSTRACT 

The  exact  cause  of  decompression  sickness  (DCS)  is  not  known.  Several  theories  try  to 
predict  the  risk  of  DCS  through  bubbie  growth  models.  These  models  describe  the  mechanics  of 
bubble  growth  based  on  diffusion  of  gas  from  a  tissue  into  a  bubble  due  to  a  change  in  pressure. 
To  assess  the  validity  of  these  theoretical  bubble  growth  equations,  an  in  vitro  experiment  was 
performed  to  simulate  a  subject  going  to  altitude.  The  experiment  was  set  up  to  eliminate 
unknowns  in  the  growth  equations.  A  model  altitude  chamber  containing  water  was  subjected  to 
various  pressure  differentials  and  the  resulting  microbubbles  were  observed  through  a 
microscope  connected  to  a  video  camera.  A  computer  based  image  analysis  system  was  used  to 
determine  the  growth  rate  from  the  video  recordings.  Three  pressures  500  mmHg,  380  mmHg, 
and  250  mmHg,  and  2  surface  tensions  72  dynes/cm  and  40  dynes/cm  were  used  for  growing 
bubbles.  Although  the  solution  with  the  lower  surface  tension  produced  more  small  bubbles,  5 
pm  in  diameter,  there  was  no  significant  difference  in  the  bubbie  growth  rate.  When  the  data 
was  compared  to  the  theory,  the  theoretical  dr/dt  differed  by  a  factor  of  2  •  4.  The  bubbles  that 
grew  were  preexisting  in  the  fluid;  only  bubbles  that  could  be  seen  with  the  microscope  were 
measured  for  their  growth  rates.  Some  bubbles,  15  pm  or  less,  did  not  grow  in  the  time  frame 
observed,  while  other  bubbles  that  were  slightly  larger,  30  pm,  did  grow. 
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INTRODUCTION 


Decompression  sickness  (DCS)  afflicts  many  divers  and  aviators.  It  is  attributed  to  an 
excess  of  nitrogen  in  the  body  that  causes  bubble  formation  because  of  a  reduction  in  pressure. 
For  a  diver,  this  happens  during  his  ascent  from  the  bottom  after  his  task  is  completed.  While 
for  an  aviator  it  occurs  on  his  ascent  and  duration  at  altitude,  before  the  task  is  completed.  For 
diving,  DCS  is  prevented  by  avoiding  certain  supersaturations  of  nitrogen.  This  is  done  by  the 
use  of  tables  that  prescribe  the  dive  profile.  Even  if  a  diver  deviates  from  the  prescribed 
profile  he  can  slow  his  ascent  and  even  recompress  if  necessary.  For  the  aviator,  he  can  not 
avoid  going  to  altitude  so  he  must  always  accept  some  risk,  if  he  shows  signs  of  DCS,  he  may  not 
be  able  to  alter  his  profile  and  recx>mpress  by  descending.  To  more  safely  plan  flights,  a  model 
is  needed  that  can  predict  the  risk  of  getting  DCS  based  upon  the  flight  profile.  This  model  could 
be  used  to  determine  how  much  prebreathe  time  would  be  necessary.  If  a  model  is  developed  and 
validated,  it  can  then  be  used  for  monitoring  the  risk  during  the  flight. 

Because  DCS  is  thought  to  be  caused  by  nitrogen  bubbles,  the  model  should  incorporate 
how  these  bubbles  form  and  grow.  Currently  it  is  not  clearly  understood  how  bubbles  form. 
They  are  thought  to  originate  from  bubble  nucleation  sites  or  preexisting  bubbles.  Two  factors 
that  are  important  for  the  formation  of  bubbles  are  the  gas  supersaturation  and  the  surface 
tension  (4).  Tissues  which  become  supersaturated  due  to  the  reduction  in  ambient  pressure 
attempt  to  re  •  equilibrate  by  diffusion  of  gas  out  of  the  tissue  and  into  the  bubble,  but  this  only 
happens  if  there  is  a  bubble  for  the  gas  to  diffuse  into.  The  surface  tension  contributes  the  most 
to  the  pressure  in  the  bubble.  A  high  surface  tension  corresponds  to  high  pressure;  this  hinders 
bubble  formation.  A  lower  surface  tension  will  reduce  this  pressure  and  allow  bubbles  to  form. 
The  surface  tension  can  be  reduced  from  any  number  of  surfactants  in  the  body.  The  easiest 
places  being  the  lungs,  which  are  covered  with  surfactant  and  have  gas  exchange  taking  place, 
and  in  the  joints,  where  the  lubricating  fluid  could  act  as  a  surfactant. 

After  bubbles  are  formed,  it  is  necessary  to  know  how  fast  they  will  grow.  This  will 
give  information  as  to  how  long  it  will  take  for  the  bubbles  to  reach  a  critical  size  where 
problems  may  occur.  In  research  altitude  chamber  flights  there  is  usually  a  latency  period  of 
1/2-2  hours  before  a  subject  will  develop  bubbles  and  show  symptoms  of  DCS.  it  is  unclear 
whether  it  is  the  number  of  bubbles  present,  the  size  of  the  bubbles,  the  amount  of  gas  which 
has  evolved  into  the  bubbles,  or  the  location  of  the  bubbles  that  causes  DCS. 
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To  gain  a  better  understanding  of  DCS.  the  growth  process  of  microbubbies  wiii  be 
observed.  If  this  growth  can  be  accurately  described,  maybe  a  relationship  between  bubbles  and 
DCS  could  be  developed  to  assess  an  aviators  risk.  This  growth  will  occur  at  altitude  because  of  a 
pressure  difference. 

THEORY 

Diffusion  of  a  gas  is  caused  by  a  pressure  gradient  between  a  bubble  in  a  fluid  and  the 
fluid.  A  gradient  can  be  caused  by  the  warming  of  the  fluid;  the  fluid  is  going  from  a  saturated 
state  at  one  temperature  to  a  supersaturated  state  at  a  higher  temperature.  The  gradient  can  be 
caused  by  a  decrease  in  ambient  pressure.  When  the  ambient  pressure  oecreases,  the  partial 
pressure  of  the  gases  in  the  fluid  become  larger  than  the  ambient  pressure,  this  causes  the  gas 
to  diffuse  from  the  liquid  into  the  bubble. 

Several  theories  have  been  proposed  to  describe  this  process,  but  none  of  them  have  been 
validated.  Both  Van  Llew's  (2)  and  Gernhardfs  (4)  theory  relate  the  growth  of  bubbles  to  DCS. 
Because  these  theories  are  dealing  with  physiological  systems  there  are  many  unknowns: 
perfusion,  solubilities,  surface  tensions,  tissue  half  times,  and  even  how  to  deal  with  the 
complexities  of  different  growing  medium. 

Besides  the  uncertainties  in  describing  the  growth,  there  is  a  question  of  when  does  the 
growth  start.  The  theories  that  describe  growth  do  not  describe  how  the  bubbles  form  In  the 
first  place. 

If  a  chamber  of  water  is  used  for  (he  growing  medium  of  the  bubbles,  many 
uncertainties  can  be  removed.  There  is  no  perfusion  of  the  water;  thus,  there  is  no  need  for  a 
tissue  half  time.  The  physical  constants  for  water  are  known;  therefore,  there  is  no  ambiguity 
in  which  constant  should  be  used.  If  bubbles  are  present  in  this  fluid,  the  chamber  can  be  taken 
to  an  altitude  where  the  bubbles  will  grow. 

There  will  be  two  parts  to  the  growth  of  the  bubbles.  The  first  is  a  Boyle's  law  free 
expansion  due  to  the  change  in  pressure.  The  next  part  is  the  growth  due  to  diffusion  of  gas  from 
the  fluid  into  the  bubble.  This  phase  takes  significantly  longer.  Diffusion  will  be  affected  by  the 
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pressure,  size,  and  the  number  of  bubbles,  while  the  free  expansion  is  due  solely  to  the  change 
in  pressure. 

in  order  to  describe  how  bubbles  should  grow,  a  theoretical  derivation  of  the  mechanics 
is  included.  This  derivation  will  be  used  for  comparing  how  bubbies  grow  in  a  simulated 
altitude  chamber,  in  this  derivation  the  bubbies  will  be  assumed  spherical.  The  volume  of  a 
sphere  is: 

V-4jir3/3  (1) 

When  studying  a  bubble,  it  is  necessary  to  know  what  the  pressure  in  the  bubble  is.  Three 
factors  make  up  the  pressure  in  a  bubble.  Pt;  the  hydrostatic  pressure,  or  the  barometric 

pressure.  Ph,  the  pressure  due  to  surface  tension.  2S/r,  where  S  is  the  surface  tension  and  r  is 

the  radius  of  the  bubble,  and  the  pressure  from  the  tissue  elasticity,  H4nr3/3,  where  H  is  the 
Bulk  modulus  of  tissue  elasticity.  For  this  derivation,  the  pressure  from  tissue  elasticity  will 
be  ignored.  Vaiues  of  H  for  water  were  found,  and  when  they  were  used  in  the  equations  they 
made  no  significance.  It  is  assumed  that  H  for  a  tissue  wouid  be  less  than  H  for  water,  so 
ignoring  its  affects  does  not  alter  the  values  given  by  the  equations.  Generally,  for  a  DCS  study, 
the  pressure  in  the  bubble  is  divided  into  two  parts,  the  pressure  due  to  nitrogen  and  the 
pressure  due  to  the  tissue  gases  (oxygen,  carbon  dioxide,  and  water  vapor).  For  a  model  altitude 
chamber  there  are  no  metabolic  processes  to  regulate  the  tissue  gases;  therefore,  the  pressure 
is: 

Pb  =  Ph  +  25/r  (  2 ) 

The  first  part  of  the  growth  is  a  Boyle's  law  expansion.  Boyle's  law  describes  the  behavior  of  an 
ideai  gas  moving  from  one  state  to  another  at  a  constant  temperature. 

PlVt  =  P2V2  (3) 

When  Boyle's  law  is  applied  to  a  spherical  bubble  and  equations  (1)  and  (2)  are  used  for  P  and 
V  an  equation  which  describes  the  free  expansion  is  derived. 

^(*’2)  =  Ph2  +  25/r2  -  Pbi(ri/r2)3  »  0  (4) 

•  After  the  Boyle's  law  expansion  gas  will  begin  to  diffuse  into  the  bubble.  Two  equations  have 
been  derived  to  describe  this  process.  One  by  Gernhardt  and  one  by  Van  Liew.  Gernhardt's 
equation  is  based  on  diffusion  limited  gas  transport  across  the  bubble  /  fluid  interface  based  on  a 
specified  shell  thickness. 


dr  > 

aD(P,  -  Pb)  P, 

(5) 

dT 

h(Pb  -  25/3  r) 
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a  -  SolubHity  of  gas  in  the  tissues 
D  «  Diffusivity 

R  «*  Pressure  of  gas  in  the  tissue 
Pb  «  Pressure  of  gas  in  the  bubble 
P»  -  Standard  pressure 

h  »  Effective  thickness  of  the  diffusion  barrier  between  the  bubble  and  the  tissue 
S  »  Surface  tension 
r  >  Radius 

While  the  equation  derived  by  Van  Liew  describes  the  same  process  it  has  a  slightly  different 
form. 

dr/dt  «  (aDP8)(1  •  Pt/Pb)0/r  +  (6) 

where  X  «  V(a  bkQ)/<atO). 

Because  gas  is  diffusing  Into  the  bubble,  the  pressure  of  the  gas  in  the  tissue  will  be 
decreasing.  This  process  can  be  described  by  doing  a  mole  balance,  (moles  of  gas  in  tissue)statei 

-  (  moles  of  gas  in  tissue)  state2  +  (moles  of  gas  diffused  into  bubbles). 

Using  the  mole  balance  and  the  ideal  gas  law  an  equation  which  describes  the  change  in  pressure 
is  derived. 

Ptg  a  Pti  -  NPb4jc(r23  -ri3)/3aPs  (7) 

Equations  (4),  (3),  (6),  and  (7)  can  then  be  used  in  a  computer  program  to  simulate  the 
growth  of  bubbles  at  altitude. 

METHODS 

Figure  one  shovrs  a  general  set  up  of  the  equipment  used,  which  includes:  a  model  altitude 
chamber,  a  microscope,  a  microscope  stand,  a  light,  a  camera,  a  T.V.,  and  a  VCR.  The  model 
attitude  chamber  was  used  to  reduce  the  pressure.  Inside  the  chamber  was  the  fluid  to  be  taken 
up  to  altitude.  In  this  fluid  there  was  a  100  urn  wire  that  was  used  for  a  reference  for  making 
measurements.  The  microscope  allowed  for  the  visualization  of  the  bubbles  as  they  grew. 
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Fig.  1  Experimental  apparatus  used.  Pressure  device  is  on  left, 

connected  to  the  chamber  of  water.  Microscope  with  attached 
camera  is  in  center.  The  microscope  was  used  to  focus  on  a 
preexisting  bubble,  the  pressure  was  dropped,  while  the 
camera  recorded  it. 


Attached  to  the  microscope  was  a  camera,  which  was  ou^ut  to  the  T.V  and  VCR. 


To  record  the  growth  rate  of  the  bubbles  the  following  procedure  was  used.  The  chamber 
was  filled  with  the  fluid  to  oe  taken  to  altitude.  This  fluid  was  either  100%  distilled  H2O  or  a 

mixture  of  80%  distilled  HaO  and  20%  Acetone.  The  Acetone  was  added  to  reduce  the  surface 

tension  of  the  water  from  72  dynes/cm  to  40  dynes  /cm.  The  microscope  was  then  focused  on  a 
bubble  which  would  be  followed  as  it  was  taken  to  altitude,  it  should  be  noted  that  all  bubbles 
measured  were  preexisting  bubbles  stuck  to  the  wire.  The  pressure  in  the  chamber  was  then 
dropped  to  one  of  these  three  pressures.  250  mmHg,  380  mmHg,  500  mmHg.  The  pressure 
drop  was  a  step  change.  This  decrease  in  pressure  set  up  the  pressure  gradient  for  air  to  diffuse 
from  the  fluid  into  the  bubble,  causing  the  bubble  to  grow.  This  bubble  growth  was  recorded  on 
the  VCR.  The  bubbles  were  left  to  grow  as  long  as  possible;  usually  less  than  30  minutes.  As 
the  bubbles  became  larger,  they  either  moved  out  of  the  field  of  view  or  they  pulled  off  the  wire 
and  floated  away. 

After  the  growth  was  recorded,  the  diameter  as  a  function  of  time  was  measured.  The 
tape  was  played  back  and  digitzed  into  a  Macintosh  computer.  The  computer  could  grab  images 
from  the  tape  at  a  rale  of  l/30th  of  a  second.  For  the  first  two  minutes  of  growth  an  image  was 
taken  every  1-2  seconds.  After  this  time  period,  an  image  was  taken  every  6,  10,  or  20 
seconds. 


A  typical  image  is  in  figure  two.  To  measure  the  diameter,  a  scale  was  first  set  up  using 
the  100  pm  wire.  A  line  was  drawn  across  the  wire.  Then  a  straight  line  was  drawn  across  the 
bubble  and  measured.  Care  was  taken  to  ensure  that  this  line  was  a  diameter.  To  make 
measuring  easier,  the  image  could  be  thresholded  or  have  image  processing  software  trace  the 
edges.  The  resulting  image  was  only  black  and  white  which  allowed  the  edges  to  be  easily 
detected  (figure  3).  The  diameter  was  not  the  only  possibility  for  measuring  the  size.  An 
outline  of  the  bubble  could  be  traced  and  have  its  length  measured;  this  would  give  the 
circumference  which  is  related  to  the  radius  by  C  »  2nr.  The  area  inside  the  outline  could  be 
measured;  the  area  is  related  to  the  radius  by  A  -  nr^.  Another  method  to  measure  the  radius 
could  have  been  to  calculate  the  equation  of  a  circle  by  choosing  three  points  on  the  edge  of  the 
bubble.  The  timer  from  the  VCR  was  used  as  the  time  base  (labeled  time  counter  in  figure  2). 

The  data  for  the  diameter  was  then  entered  into  the  computer  and  a  Nelder  -  Mead 
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wire 


bubble 


^  1 0Oum 


time  counter 


This  K'l  a  Typicai  image  after  it  is  digitzed  mto  the  computer 
The  veriicai  bana  is  the  wire  and  the  bubble  .s  growing  on  the 
right  Side  of  the  wirt?  The  time  counter  is  output  from  the 
VCR  No  ♦.■nnancements  have  been  made  to  this  image 
Generaiiy  the  images  appear  much  cleaner  on  the  computer 
screen 
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simplex  algorithm  for  a  non-linear  optimization  routine  was  used  to  fit  the  data  to  an 
ej^nentiai  equation  of  the  form: 

Di(t)  -  Ci(1  -  exp(-C2t))  +  Do  (8) 

Di  «  Diameter  of  the  bubble 
Cl  >  Constant 
Cz  •  Constant 
t  -  Time 

Do  >  initial  Diameter 

To  get  an  equation  for  the  radius  as  a  function  of  time,  equation  (8)  should  be  divided  by  2. 

The  data  was  then  checked  against  Van  Liew’s  and  Gernhardt's  equations.  A  computer 
program  was  used  to  simulate  conditions  in  the  lab.  The  program  incorporates  the  ability  for 
gas  to  wash  out  of  the  tissue  based  on  a  perfusion  model.  However,  there  was  no  perfu^n  in 
the  lab;  to  remove  its  effects  the  tissue  half  time  was  set  targe  so  that  perfusion  couki  not  take 
place  during  the  time  frame  being  observed.  The  values  for  inputs  to  the  program  ^re  as 
follows: 

a  «  .0125  1/atm 
D  =  .00002  cm2/s  (3) 

6  »  72.0  dynes/cm  (3) 

%iiier*  breathing  gas  >  100% 

Pn2  initial  »  760  mmHg 
Initial  Ph  -  760  mmHg 

Half  time  »  100000  min 
Initial  radius  «  .5*Do  cm 

Time  step  «  12  sec 

By  setting  the  Initial  Pn2  to  760  mmHg,  air  became  the  gas  involved  in  the  diffusion  process 
and  not  just  nitrogen.  The  data  file,  pressure.dat,  contains  some  of  the  constants  and  the 
pressure  profile.  The  program  will  read  in  the  current  pressure  and  check  to  see  if  there  was  a 
change  in  the  pressure.  If  there  was  a  change,  a  Boyle's  law  expansion  was  calculated.  The 
difference  in  pressure  set  up  the  gradient  for  gas  to  diffuse  into  the  bubble.  A  new  radius  was 
then  calculated  based  on  both  rate  equations.  Because  gas  diffused  into  the  bubble,  the  tissue 
pressure  dropped.  These  calcuiations  were  performed  for  every  12  seconds  and  the  output 
compared  to  the  data.  The  profile  used  was  the  same  as  that  in  the  lab.  To  compare  the  predicted 
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data  with  the  measured  data,  the  radius  was  multiplied  by  two  and  converted  from  cm  to  (im. 


RESULTS 


The  lab  data  was  fit  to  an  exponential  curve.  The  time  constant  of  the  exponential  wa 
averaged  for  trials  at  each  pressure  (tables  1 ,  2,  and  3).  Without  Acetone,  surface  tension  is 
72  dynes/cm  and  with  the  Acetone,  surface  tension  is  40  dynes/cm.  A  t  •  test  was  performed  on 
tile  data  to  see  if  there  was  any  statistical  significance  with  and  without  Acetone.  As  can  be  seen 
from  the  tables,  there  was  not  a  difference  in  the  exponential  of  the  data  when  Acetone  was  added. 
The  full  set  of  coefficients  is  in  appendix  A.  The  <x>efficients  represent  the  data  after  the  Boyle's 
law  expansion  was  completed.  The  first  5  seconds  of  the  Di{t)  was  removed  before  the  equations 
were  fit  to  the  data. 


Table  1. 

250  mmHg  with  and  without  Acetone. 

No  Acetone:  C2  -  .004890;  SD  -  .0024837 

With  Acetone:  C2  -  .004857;  SD  -  .0018978 

Student  t  -  .0253;  No  significant  difference 

Table  2. 

380  mmHg  with  and  without  Acetone. 

No  Acetone:  C2  »  .0016863;  SD  -  .0002998 

With  Acetone:  C2  -  .0022521;  SD  .  .0007397 

Student  t  »  1.825;  No  significant  difference 


Table  3. 


500  mmHg  with  and  without  Acetone. 

No  Acetone:  C2  »  .0013222;  SD  «  .0004793 
With  Acetone;  C2  «  .0012631;  SD  -  .0004835 


Student  t  «  .021 ;  No  significant  difference 


From  observation  it  was  noted  that  the  density  of  bubbles  that  grew  was  5-10  bubbles 
per  ml.  When  the  surface  tension  was  decreased,  many  small  bubbles,  5  -  25  pm,  were 
present.  These  small  bubbles  constantly  floated  in  the  fluid;  however,  they  never  stuck  to  the 
wire.  Bubbles  less  than  15  pm  diameter  did  not  grow  in  the  time  frame  observed.  However, 
bubbles  of  30  pm  diameter  did  grow.  With  the  surface  tension  reduced,  bubbles  of  <5  pm 
persisted  in  the  fluid.  This  was  seen  by  taking  a  sample  of  the  Acetone  /  water  mixture  and 
looking  at  it  under  a  high  power  microscope.  With  plain  water  these  5pm  bubbles  were  rarely 


seen. 


Small  bubbles  formed  on  the  surfaces  of  larger  bubbles.  A  bubble  of  300  |im  diameter 
would  have  as  many  as  50  smaller  bubbles  on  its  surface.  These  bubbles  did  grow,  but  not  at  the 
same  rate  as  the  larger  one  to  which  they  were  attached. 

When  the  computer  program  was  run.  neither  equation  matched  the  data,  if  the  h. 
barrier  length,  in  Gernhardt's  equation  was  replaced  with  r,  then  both  equations  gave  similar 
answers.  The  prediction  for  the  Boyle's  law  expartsion  was  similar  to  the  expansion  seen  in  the 
lab.  The  equation  predicted  an  expansion  that  was  smaller  than  what  was  observed.  Because 
diffusion  was  taking  place  while  the  butole  was  expanding,  a  larger  expansion  than  expected  was 
observed.  The  rate  was  off  approximately  by  a  factor  of  2,  3,  or  4  (figure  4).  When  dr/dt  was 
multiplied  by  this  factor,  the  equations  more  closely  matched  the  data.  However,  the  equations 
predicted  the  growth  was  too  fast  for  the  smaller  bubbles. 

DISCUSSION 

Neither  theory  accurately  predicts  bubble  growth  for  the  lab.  By  replacing  the  effective 
diffusion  barrier  with  the  radius,  both  methods  predict  the  same  result.  If  the  theoretical  dr/dt 
is  multiplied  by  a  factor  of  2  •  4,  the  equations  match  the  data.  Van  Liew's  equation  does  not 
leave  room  for  adding  such  a  factor  for  the  lab.  However,  the  h  can  be  defined  as  a  function  of 
the  radius.  For  example,  at  250  mmHg,  h  could  equal  .25*r.  This  allows  the  theoretical 
equation  to  follow  the  data  for  the  middle  sized  bubbles.  Even  with  this  correction  the  theory 
does  not  work  for  the  smaller  sized  bubbles  that  grow.  Since  dr/dt  is  increased  with  smaller 
radii,  a  small  bubble  is  predicted  to  grow  very  rapidly.  This  is  not  the  case.  A  small  bubble 
grows  at  approximately  the  same  rate  as  a  larger  bubble. 

The  equations  do  not  describe  why  some  bubbles  do  not  grow,  or  grow  very  slowly.  If  a 
bubble  radius  of  7  pm  is  put  into  the  equation  then  it  will  grow  rather  large  in  a  couple  of 
minutes.  A  7  pm  buttle  never  grew  while  under  observation  in  the  lab.  There  appeared  to  be  a 
threshold  for  bubbles  to  grow.  In  trials  were  there  was  not  a  preexisting  bubble  to  focus  on,  the 
chamber  was  taken  to  altitude  and  left  for  as  long  as  possible,  1<2  or  more  hours.  In  these 
trials,  larger  bubbles  were  not  found  while  the  smaller  ones  were  present.  A  30  pm  diameter 
bubble  grew  while  a  15  pm  diameter  bubble  either  did  not  grow  or  grew  very  slowly. 
Somewhere  in  between  these  two  sizes  could  be  a  cut  off,  or  a  gray  area  where  bubbles  will 
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either  grow  or  not  grow.  If  there  is  a  cut  off.  it  is  probably  around  20  pm  in  diameter.  A 
possible  explanation  is  that  the  pressure  due  to  surface  tension  at  this  size  is  too  large  for  a 
bubble  to  grow. 

When  the  surface  tension  was  reduced,  bubbles  of  5pm  diameter  existed,  while  they 
were  rare  with  tfie  higher  surface  tension.  This  supports  the  theory  that  the  surface  tension  is 
the  main  inhibitor  for  bubble  formation.  Bubbles  will  tend  to  form  in  areas  of  reduced  surface 
tension.  Blood  normally  has  a  S  of  45-50  dynes/cm.  physiological  lipids  have  a  5  of  20-30 
dynes/cm.  and  in  the  lungs  the  surface  tension  can  be  as  low  as  5  dynes/cm.  This  allows  the 
bubbles  to  form,  but  how  they  become  large  enough  to  grow  is  unknown.  The  ^ition  of  energy, 
heat  or  kinetic,  to  the  system  may  be  a  possibility.  If  the  ambient  temperature  rises,  there  will 
be  heat  flow  into  the  system  that  could  cause  the  bubble  to  grow.  Kinetic  energy  was  added  to  the 
system  by  striking  the  side  of  the  chamber.  The  spontaneous  generation  of  hundreds  of  bubbles 
was  the  result.  Many  bubbles  were  too  small  to  grow;  however,,  others  were  of  adequate  size. 
In  humans,  this  type  of  kinetic  energy  could  arise  from  movement.  Motion  causes  stress  across 
the  joints  and  the  energy  produced  when  joints  impact  can  cause  bubble  formation.  In  test 
subjects,  there  was  a  higher  incidence  of  DCS  when  the  subject  exercised. 

Smaller  bubbles  always  formed  on  the  surface  of  the  larger  bubbles.  While  these 
bubbles  were  of  the  size  that  would  normally  not  grow,  <15  pm,  they  did  grow.  The  difference 
might  be  that  diffusion  is  now  from  a  gas  to  a  gas,  one  bubble  to  another.  This  could  be  a  way 
that  bubbles  grow  in  the  body.  Small  bubbles  could  be  attached  to  red  blood  cells,  and  grow 
through  diffusion  of  gas  from  the  cells  to  the  bubble.  These  bubbles  were  not  measured  for  their 
growth  rate  because  of  their  nonsphericai  shape  and  the  difficulty  in  focusing  on  them.  A 
possible  reason  for  the  existence  of  the  these  bubbles  is  that  they  were  in  the  fluid,  and  as  the 
larger  bubble  grew  it  began  to  occupy  the  space  were  these  smaller  bubbles  existed.  Anofoer 
possibility  is  that  the  smaller  bubbles  floated  by  and  were  attracted  to  the  larger  bubble,  this 
was  observed  on  several  occasions.  A  third  possibility  is  that  the  smaller  bubbles  pinched  off  of 
the  larger  bubUe  in  an  effort  to  increase  the  surf^  area  to  volume  ratio  to  continue  growth. 

Overall  the  bubbles  exhibit  three  phases  of  growth.  The  first  is  the  rapid  Boyle's  law 
expansion.  The  second  is  a  phase  of  linear  growth,  and  the  third  is  a  decaying  rate  of  growth. 
The  gas  entering  the  butfote  decreases  the  gas  concentration  in  the  fluid.  This  decreases  the 
pressure  gradient  and  the  amount  of  gas  available  for  diffusion,  which  causes  the  third  phase.  A 
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representation  of  the  third  phase  is  shown  in  equation  (7).  The  ultimate  size  that  a  buM)le  can 
reach  is  dependent  on  this  equation;  thus  it  is  dependent  on  the  number  of  bubbles  in  the  fluid. 
If  there  are  many  bubbles,  they  can  not  grow  very  large  because  of  the  competition  between  each 
other  for  the  gas.  The  rate  of  linear  growth  is  dependent  on  the  altitude.  The  growth  at  500 
mmHg  and  380  mmHg  is  significantly  less  than  the  growth  at  250  mmHg.  The  relationship  of 
growth  rate  to  altitude  does  not  have  a  linear  relationship.  This  is  seen  by  comparing  the  tinte 
constants.  C2,  from  the  curve  fits.  There  is  only  a  difference  of  .00036  between  500  mmHg 
and  380  mmHg,  while  the  difference  between  380  mmHg  and  250  mmHg  is  .0032,  almost  a 
factor  of  ten  difference  for  an  almost  identical  change  in  pressure. 

CONCLUSION 

Growth  of  microbubbles  is  important  to  the  study  of  DCS.  Symptoms  of  DCS  are 
attributed  to  the  existence  of  these  bubbles.  If  an  understanding  of  DCS  is  desired,  then  an 
understanding  of  how  bubbles  form  and  grow  must  be  known.  The  growth  rate  of  bubbles  was 
studied  in  the  tab,  and  an  attempt  was  made  to  <»rrelate  the  data  with  the  existing  theory. 

If  bubbles  form  from  bubble  nuclei,  how  do  they  grow?  Small  bubbles  did  not 
grow.  This  suggests  that  there  is  some  other  catalyst  for  bubbles  to  reach  a  size  where  they  will 
grow.  This  catalyst  could  be  the  link  in  describing  the  latency  period  between  the  time  when  a 
subject  goes  to  altitude  and  the  time  when  bubbles  are  seen  in  his  body,  in  the  laboratory,  the 
time  course  for  a  bubble  to  grow  to  the  size  commonly  found  in  the  inferior  vena  cava  (IVC), 
100  pm,  was  only  a  couple  of  minutes.  A  subject  can  be  taken  to  altitude  and  bubbles  may  not 
be  seen  for  a  couple  of  hours.  Clearly  there  may  be  a  process  that  prevents  these  bubbles  from 
growing. 

To  further  develop  a  model  that  predicts  the  risk  of  DCS,  it  is  necessary  to  know  how  a 
bubble  forms.  If  this  information  could  be  discovered,  it  would  be  known  what  types  of  flights 
would  lead  to  bubble  formation.  A  profile  could  then  be  developed  that  would  manipulate  the 
formation  process  to  hinder  bubble  formation  and  decrease  the  risk  of  DCS.  An  important  ^ctor 
in  bubble  formation  is  the  surface  tension.  Low  surface  tension  allows  bubbles  to  form  easily 
and  exist  for  a  long  period  of  time.  Currently  bubbles  are  seen  in  the  IVC  but  their  origin  is 
unknown.  They  are  too  big,  50  -  100  pm,  to  have  transported  through  small  capillary  beds.  A 
probable  scenario  is  that  the  bubbles  form  in  an  area  of  low  surface  tension,  the  lungs  or 
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possibly  the  joints.  These  bubbles  will  be  <15  pm;  they  can  be  transported  through  the 
capillaries  and  into  larger  vessels  where  some  other  process  will  allow  them  to  grow. 
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APPENDIX  A 

This  appendix  contains  all  the  coefficients  for  the  data.  The  coefficients  are  for  the 
equation: 

*  Ci(l  *  exp(-C2t))  +  Do 

The  coefficients  are  for  each  trial.  The  pressure  was  known,  and  Do  was  taken  as  the  point  after 
the  Boyle's  law  expansion. 

Table  A1 

Values  for  the  results  of  the  cunre  fits  to  the  data  when  no  Acetone  was  added 


Pressure(mmtial 

fil 

Oa 

250 

526.52 

.0064305 

267.50 

250 

494.39 

.0070759 

65.00 

250 

567.27 

.0076308 

65.00 

250 

850.38 

.0037429 

213.22 

250 

1230.71 

.0016308 

289.09 

250 

1070.15 

.0028302 

238.18 

380 

627.63 

.002198 

90.70 

380 

735.60 

.0014717 

125.58 

380 

706.82 

.0014808 

102.13 

380 

534.13 

.0016909 

129.79 

380 

753.28 

.00159 

114.00 

500 

1160.1 

.0007533 

87.5 

500 

239.64 

.0012018 

23.08 

500 

536.7 

.0012566 

241.82 

500 

393.76 

.0020815 

63.64 

500 

768.14 

.0013178 

108.00 

Table  A2. 

Values  for  the  results  of  the  curve  fits 

when  Acetone  was  added. 

PressurefmmHo) 

Ql 

Qa 

250 

1300.7 

.0023253 

318.18 

250 

500.94 

.0075987 

278.18 

250 

527.38 

.0052597 

339.62 

250 

968.30 

.006105 

170.91 

250 

1150.8 

.0033635 

174.55 

250 

991.05 

.0044943 

194.55 

380 

989.23 

.0018978 

200.00 

380 

676.80 

.0026882 

143.40 

380 

813.01 

.0030355 

249.09 

380 

1013.3 

.0025543 

207.27 

380 

773.00 

.0029063 

207.27 

380 

1011.90 

,0010456 

187.27 

380 

980.84 

.001637 

140.00 

500 

898.84 

.0010972 

135.85 

500 

607.19 

.0015083 

125.71 

500 

1076.1 

.0010087 

169.81 

500 

652.52 

.0019992 

178.18 

500 

992.55 

.0007844 

264.00 

500 

686.18 

.0017099 

136.21 

500 

1053.3 

.00073418 

212.23 
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ABSTRACT 

Whispering  is  a  common  mode  of  communication  which  has  been  overlooked  in  current  speech 
synthesis  research.  Whispered  speech  is  less  affected  by  the  complications  inherent  in  voiced  speech 
such  as  dynamic  source-tract  interaction,  variability  in  phonation  modes,  and  source-based  speaker 
dependent  differences.  These  properties  simplify  the  analysis/synthesis  task-  Investigation  of  whis¬ 
pered  speech  should  provide  insights  about  vocal-tract  dynamics  without  the  complications  of  source 
dynamics  and  source-tract  interactions.  Much  can  be  also  be  learned  from  the  comparison  of  whis¬ 
pered  speech  and  their  phonated  speech  counterparts.  This  documents  illustrates  some  of  the  po¬ 
tentials  of  whispered  speech  synthesis  research  and  describes  the  initial  stage  of  our  investigation 
of  whisper  synthesis:  the  development  of  an  analysis/resynthesis  .system  for  whispered  speech.  The 
basic  system  consists  of  a  formant  tracking  algorithm  interfaced  to  a  formant  synthesizer.  Per¬ 
formance  results  are  fair,  although  not  as  good  as  desired.  Complications  encountered  during  the 
developement  process  are  discussed  along  with  potential  solutions  and  inherent  limitations  of  the 
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ANALYSIS  AND  SYNTHESIS  OF  WHISPERED  SPEECH  USING  A  FORMANT 

SYNTHESIZER 

Edward  L.  Riegelsberger 

1.  INTRODUCTION 

Whispering  is  a  common  mode  of  communication  which  has  been  overicoked  in  current  speech 
synthesis  research.  Whispered  speech  is  less  affected  by  the  complications  inherent  in  voiced  speech 
such  as  dynamic  source-tract  interaction,  variability  in  phonation  modes,  and  source-based  speaker 
dependent  differences.  Whispered  speech  typically  is  depicted  as  a  white-noise  source  driving  the 
vocal-tract  filter.  The  nature  of  this  model  simplifies  the  analysis/synthesis  task  by  allowing  focus 
to  be  placed  on  vocal-tract  dynamics  without  the  complications  of  source  dynamics  and  source-tract 
interactions. 

The  long-term  goal  of  this  research  pursuit  is  the  study  of  the  synthesis  of  whispered  speech.  We 
expect  that  by  studying  the  synthesis  of  whispered  speech,  knowledge  will  be  gained  that  can  be 
applied  to  the  synthesis  of  speech  in  general. 

This  document  primarily  reports  progress  on  the  first  stage  of  research:  developing  a  frame- 
based  speech  anaiysis/resynthesis  system  designed  for  whispered  speech.  The  first  section  provides 
background  information  about  whispered  speech  and  gives  a  review  of  whispered  speech  research. 
Some  of  our  motivations  are  described  here  along  with  potential  areas  of  exploration.  The  remainder 
of  the  report  describes  the  design  of  a  speech  anaiysis/resynthesis  system  for  whispered  speech. 
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IL  WHISPERED  SPEECH 


;4.  Background 

Whispered  sounds  are  produced  by  turbulence  resulting  from  air  rushing  past  the  slightly  open 
glottis.  During  a  whisper,  the  glottis  tenses  forming  a  triangular  opening  through  which  turbulence 
is  generated.  Physio’ogicad  studies  report  that  the  glottal  opening  is  larger  during  whispered  speech 
than  during  voiced  speech  and  that  the  vocal  folds  during  whispered  speech  are  seen  to  move  very 
slightly  or  not  at  all  [1].  Acoustic  studies  have  found  whispered  speech  to  have  a  flatter  spectrum 
than  voiced  speech,  with  a  level  power  spectrum  between  ‘200  and  2OOOH2  [2].  Whispering  is  an 
inefficient  speaking  mode,  and  is  limited  in  the  number  of  words  than  can  be  whispered  in  a  single 
breath.  A  loud  whisper’s  amplitude  is  typically  20  db  less  than  in  conversational  speech  and  only 
slight  variations  in  intensity  are  possible. 

Previous  research  of  whispered  speech  has  primarily  been  concerned  with  perception  Kallail 
and  Emanuel  [3]  [4]  found  the  identification  of  whispered  vowels  to  be  65%  for  males  and  64%  for 
females.  This  is  about  15%  less  than  for  normally  phonated  vowels.  Tartter  [5]  obtained  a  64%. 
overall  identification  accuracy  for  18  consonants  (both  normally  voiced  and  normally  unvoiced).  He 
also  reported  that  whispered  speech  contains  reliable  cues  to  place  and  manner  of  articulation,  and 
somewhat  less  prominent  cues  to  voicing  [5].  It  also  has  been  demonstrated  that  whispered  speech 
is  sufficient  to  perform  sex  discrimination  and  that  listeners  can  discriminate  between  two  speakers 
from  their  whispered  speech  [6]. 

Past  studies  have  repeatedly  observed  that  whispered  phonemes  have  higher  formants  than  their 
voiced  counterparts  [3].  Kallail  and  Emanuel  found  statistically  significant  differences  in  first  formant 
frequencies  between  whispered  and  phonated  vowels.  Similar  trends  were  observed  in  F2  and  F3  for 
certain  vowels.  It  is  suspected  that  a  glottis  slightly  more  open  during  whispered  than  in  phonated 
speech  produces  the  constant  rise  in  formant  frequencies.  This  would  help  to  explain  why  the  effect 
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is  most  prominent  for  lower  formants,  which  are  more  affected  by  tracheal  coupling.  A  quantitative 
explanation  of  these  effects  remains  to  be  given. 

B.  MoUvaiton 

Little  has  been  reported  on  the  synthesis  of  whispered  speech  although  the  simplifying  assump¬ 
tions  that  can  be  made  about  whispered  speech  make  it  an  attractive  subject  for  syr  thesis.  As 
a  non-periodic  waveform,  there  is  no  fundamental  frequency,  open-quotient  or  other  voice  source 
variables.  There  are  no  voice/unvoiced/mixed  decisions  or  alternative  modes  of  phonation. 

The  way  in  which  the  turbulence  source  varies  spectrally  for  different  modes  and  places  of  ar¬ 
ticulation  is  not  precisely  known.  If  we  assume  that  the  spectral  variation  is  small,  and  that  the 
turbulence  spectrum  is  generally  flat  (white  noise),  then  existing  spectral  estimation  techniques  can 
be  applied  to  model  the  "true”  vocal-tract  transfer  function.  Even  if  the  aspiration  noise  is  not 
white,  the  assumption  of  a  non  time-varying  source  offers  a  new  way  of  studying  the  dynamics  of  the 
vocal-tract.  Knowledge  of  vocal-tract  dynamics  independent  of  source  dynamics  and  source-tract 
interactions  is  valuable  to  voiced  speech  analysis  and  synthesis. 

Additional  insights  applicable  to  synthesis  should  be  obtainable  from  the  comparison  of  whispered 
utterances  to  their  voiced  counterparts.  Such  comparisons  should  illustrate  the  effects  of  source-tract 
interaction  and  trac.heal  coupling  on  voiced  utterances.  It  has  been  suggested  that  some  speaker 
dependent  characteristics  may  become  apparent  through  mappings  between  phonated  and  whispered 
speech.  As  noted  by  Kallail  and  Emanuel  [3],  “  an  understanding  of  the  formant  differences  between 
whispered  and  phonated  vowels  might  aid  in  the  development  of  optimally  effective  methods  for 
identifying  certain  speaker  characteristics  (e.g.  sex)  from  speech  samples.” 

We  plan  on  exploring  many  of  the  issues  introduced  above  using  synthesis  techniques.  First 
though,  a  synthesis  method  must  be  chosen  and  a  reliable  technique  for  analyzing  whispered  speech 
in  terms  of  the  synthesizer  parameters  must  be  developed.  Automated  speech  analysis  in  this  manner 
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is  not  a  trivial  task.  The  remainder  of  this  document  describes  the  first  stage  of  our  investigation  of 
whisper  synthesis:  the  development  of  an  analysis/resynthesis  system  for  whispered  speech. 


III.  THE  FORMANT  SYNTHESIZER 

Our  present  objective  is  to  create  a  system  which  analyzes  speech  and  generate  synthesizer 
parameters  necessary  to  reproduce  the  original  utterance.  Such  a  system  is  correctly  labelled  a 
speech  encoder/decoder.  Substantial  effort  has  been  devoted  to  speech  encoder/decoders  but  with 
the  objective  of  compact  and  robust  representations  for  communication  purposes.  For  synthesis 
applications,  the  encoder/decoder  system  should  encode  speech  in  terms  of  acoustically  and  artic- 
ulatorally  significant  features.  Meaningful  speech  features  are  essential  to  the  effective  design  of 
rule-based  synthesizers.  Additionally,  these  meaningful  features  can  be  used  to  better  understand 
speech  production  and  improve  speech  synthesis. 

The  formant  synthesizer  is  a  popular  synthesis  tool  controlled  by  parameters  that  represent 
acoustic  and  articulator  features.  They  are  based  on  the  source-filter  theory  of  speech  production 
and  come  in  two  different  varieties:  cascade  and  parallel.  All  are  comprised  of  a  number  of  resonators 
which  correlate  with  formants  observed  in  speech.  In  the  cascade  formant  synthesizer,  resonators 
are  arranged  in  a  serial  manner  to  jointly  define  an  all-pole  filter.  In  a  parallel  formant  synthesizer, 
resonators  are  arranged  in  parallel  and  a  gain  term  is  associated  with  each  resonator.  The  parallel 
arrangement  results  in  a  pole-zero  transfer  function.  Clearly,  the  parallel  formant  synthesizer  is  more 
versatile  in  that  it  can  produce  a  greater  variety  of  spectral  representations.  However,  the  cascade 
formant  synthesizer  requires  fewer  controlling  parameters  and  its  all-pole  characteristic  is  sufficient 
to  describe  a  majority  of  speech  types. 

The  block  diagram  of  Figure  1  depicts  a  simple  cascade  type  formant  synthesizer.  The  source 
block  represents  the  glottis  during  voicing  and  aspiration.  Source  parameters  include  mode-of- 
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Figure  1:  Block  diagram  of  a  simple  cascade  formant  synthesizer. 

voicing,  amplitude  of  voicing,  amplitude  of  aspiration,  and  fundamental  frequency.  The  blocks 
labelled  R1  through  R5  are  resonators,  each  representing  a  single  formant,  whose  combination  is  the 
vocal-tract  transfer  function.  A  resonator  is  controlled  by  two  parameters:  formant  frequency  and 
formant  bandwidth. 


A.  The  Klait  Cascade-Parallel  Formant  Synthesizer 

The  Klatt  formant  synthesizer  [7]  is  a  hybrid  formant  synthesizer  that  incorporates  both  cascade 
and  parallel  synthesis  methods.  It  is  designed  with  parameters  to  control  voice  and  noise  source 
characteristics  tis  well  as  filter  parameters.  It  is  capable  of  generating  mixed  modes  of  voicing 
and  modeling  source-tract  interactions.  It  includes  three  different  voice  source  models  and  provides 
control  of  the  phenomena  of  diplophonia,  FO  jitter,  and  tracheal  coupling.  In  addition  to  the  cascade 
synthesizer,  a  separate  parallel  branch  exists  for  the  generation  of  frication  and  burst  noise.  Extra 
pole  zero  pairs  are  provided  to  model  nasal  and  tracheal  resonances.  It  has  been  demonstrated  to 
produce  natural  and  intelligible  synthesized  speech  and  its  sufficient  to  synthesize  all  of  the  sounds 
of  English. 


We  are  using  a  recent  version  of  the  Klatt  synthesizer  that  uses  the  KLSYN88  cascade-parallel 
formant  synthesizer  first  described  in  Klatt  and  Klatt  [8].  Since  we  will  be  using  the  aspiration  noise 
source  exclusively  for  synthesis  of  whispers,  some  more  detail  of  its  generation  is  necessary.  The 


Klatt  synthesizer  produces  aspiration  noise  using  a  random  number  generator.  The  simple  generator 
provides  a  flat  amplitude  distribution  between  limits  of  -32768  and  +32767.  Note  that  this  is  a  very 
basic  random  number  generator  whose  output  is  not  truly  Gaussian,  although  Klatl  claims  that  the 
spectral  differences  are  barely  perceptible.  The  random  numbers  are  filtered  through  two  zeros  at 
z=-0.75  and  z=l.  These  provide  soft  high-pass  and  low-pass  filtering  of  the  number  stream.  This 
stream  is  then  amplitude  adjusted  to  produce  the  final  input  to  the  resonators.  The  gain  controlling 
synthesis  parameter  AH ,  “amplitude  of  aspiration”,  is  in  decibels. 

TV.  A  FRAME-BASED  ANALYSIS/RESYNTHESIS  SYSTEM 

In  this  section  we  describe  the  design  of  a  speech  analysis/resynthesis  system  for  whispered 
speech.  The  routine  analyzes  whispered  speech,  and  generates  parameters  to  drive  the  Klatt  formant 
synthesizer  to  reproduce  the  input  utterance.  The  system  is  frame  based  (i.e.  each  frame  is  analyzed 
and  synthesized  independent  of  adjacent  frames)  and  consists  of  three  stages:  a  speech  analysis  (data 
collection)  stage,  a  translation/encoding  stage,  and  a  synthesis  stage. 

The  initial  data  collection  stage  employs  a  formant  tracking  algorithm  to  determine  formant  cen¬ 
ter  frequencies  and  bandwidths  for  every  frame.  The  tracking  algorithm  used  is  related  to  the  one 
proposed  by  Secrest  and  Doddington  [9].  The  algorithm  uses  linear  prediction  analysis  to  generate 
formant  frequency  candidates  and  applies  dynamic  programming  to  smooth  formant  trajectory  esti¬ 
mates  by  imposing  frequency  continuity  constraints.  For  each  frame,  rms  energy  is  also  calculated. 

The  second  stage  generates  parameters  necessary  to  drive  the  Klatt  synthesizer  from  the  mea¬ 
surements  and  estimate  of  the  first  stage.  We  do  not  intend  to  modify  the  Klatt  synthesizer  (third 
stage  of  system)  except  in  special  cases  so  it  is  the  second  stage  in  which  the  majority  of  processing 
will  occur.  Since  we  will  be  analyzing  whispered  speech  only,  information  about  mode  of  voicing  and 
voicing  characteristics  are  not  necessary.  We  are  also  assuming  that  source/tract  interactions  are 
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negligible.  Therefore  only  formant  frequencies  F1-F5,  formant  bandwidths  B1-B5,  and  amplitude 
of  aspiration  AH  values  must  be  generated  by  stage  two.  The  update  interval  is  chosen  to  be  5 
msec,  so  synthesizer  parameter  values  must  be  produced  every  5  milliseconds.  Window  width,  the 
data  window  over  which  estimates  will  be  calculated,  is  independent  of  the  update  interval.  Win¬ 
dow  width  is  generadly  larger  than  the  update  interval.  Although  the  appropriate  window  width  is 
debatable,  a  default  width  of  50msec  is  used. 

A  collection  of  VCV’s  and  CVC’s,  both  phonated  and  whispered,  have  been  collected  from  a 
male  and  a  female  speaker.  Selected  whispered  tokens  from  this  collection  have  been  used  to  eval¬ 
uate  the  basic  analysis  resynthesis  system.  Surprisingly,  the  frame  by  frame  method  of  encoding 
and  synthesizing  speech  presents  some  difficulties  in  implementation.  Anetiysis  of  the  initial  analy¬ 
sis/resynthesis  routine  on  whispered  steady-state  vowels  produces  whispered  speech  containing  click, 
burp,  and  gargle  like  transients.  It  appears  that  the  inherent  spectral  variability  of  whispered  speech 
demands  very  careful  attention  to  factors  such  as  smoothing,  window  width  and  mean  filter  power. 
Secondly,  transients  are  introduced  at  frame  boundaries  by  the  formant  synthesizer  which  required 
special  corrective  measures. 

The  performance  of  this  initial  version  of  an  analysis/resynthesis  system  is  unacceptable.  The 
following  section  describes  efforts  to  improve  the  performance  of  the  system.  Some  of  the  inherent 
limitations  of  frame  based  analysis/resynthesis  are  also  discussed. 

V.  SYSTEM  EVALUATION  AND  IMPROVEMENT 

Initial  tests  of  the  frame  based  analysis/resynthesis  system  demonstrate  audible  transient  noise 
in  steady-state  whispered  vowels.  This  is  unacceptable  whisper  reproduction.  Four  potential  sources 
of  these  transients  are  investigated:  inappropriate  window  width,  abrupt  amplitude  variations,  syn¬ 
thesizer  induced  transients  and  inherent  spectral  variation.  Each  of  these  issues  are  addressed,  and 


some  possible  corrective  measures  are  offered. 


A.  Window  width 

A  steady-state  whispered  vowel  /i/  was  analyzed /synthesized  using  a  range  of  window  widths; 
25msec,  50msec,  100msec,  250msec,  and  500msec.  As  expected,  transient  activity  decreases  as 
window  size  increases.  With  the  25,  50,  and  100  msec  windows,  transients  can  still  be  heard. 
The  250  and  500  msec  windows  produce  rather  steady  sounding  synthesized  speech.  Visually,  the 
synthesized  versions,  even  the  large  windowed  ones  don’t  appear  as  steady-state  as  the  original. 

In  wide-hand  spectrograms  of  the  original  and  5  synthesized  whispered  vowels,  transients  manifest 
themselves  as  vertical  lines  most  visible  at  high  frequencies.  This  demonstrates  the  impulsive  nature 
of  these  transients.  Overall,  there  is  significantly  more  spectral  energy  in  the  original  whispered 
vowel  above  4000Hz.  The  synthesized  versions’  lack  of  high  frequency  energy  demonstrates  some  of 
the  deficiencies  of  using  the  cascade  synthesizer  with  a  limited,  fixed  number  of  formants. 

Certainly,  having  a  larger  window  width  reduces  the  variability  of  the  pole  estimates,  but  window 
widths  larger  than  100msec  are  clearly  not  practical  for  the  analysis  of  continuous  speech.  As  system 
behavior  is  studied  for  consonants,  it  may  become  necessary  to  incorporate  a  dynamic  window  width 
based  on  the  rate  of  spectral  change. 

B.  Amplitude  Variations 

It  is  possible  that  some  of  the  transients  result  from  abrupt  changes  in  the  amplitude  parameter 
AH.  To  test  this,  a  steady-state  whispered  vowel  /i/  was  resynthesized  normally  and  with  AH  =  60 
for  all  time.  The  constant  amplitude  version  is  just  as  inconsistent  as  the  old.  There  is  very  little 
audible  difference  between  the  two.  There  is  a  very  slight  difference  noticeable  if  the  window  width 
is  set  to  100msec  or  larger.  The  pops  still  exist,  but  the  weaker  ones  are  less  noticeable. 

There  is  another  more  subtle  way  in  which  amplitude  variations  could  be  introducing  audible 
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transients.  Filter  parameters  specified  in  terms  of  formant  frequencies  and  bandwidths  equivalently 
specify  pole  locations  in  an  all-pole  transfer  function.  No  amplitude  information  is  included.  The 
mean  power  of  the  all-pole  filter  is  a  function  of  the  location  of  poles.  As  a  result,  a  constant 
amplitude  source  driving  two  different  all-pole  filters  will  produce  outputs  with  different  average 
powers.  Amplitude  control  independent  of  filter  function  is  desirable,  therefore,  the  filters  should  be 
normalized  with  respect  to  average  power. 

Unfortunately,  there  is  no  simple  expression  for  mean  power  in  terms  of  linear  prediction  coef¬ 
ficients.  A  numerical  evaluation  technique  is  used  instead.  An  FFT  is  used  to  calculate  the  power 
spectral  density  at  512  points.  The  sum  of  these  points  produces  a  rectangular  rule  approximation 
to  the  total  filter  power.  The  calculated  average  filter  power  is  used  to  normalize  the  all-pole  filter 
power  to  1.0.  As  a  result,  the  synthesizer  amplitude  control  is  independent  of  the  filter  function.  ' 

The  application  of  this  normalizing  function  offers  some  improvement  in  synthesis  quality  due  to 
amplitude  smoothing.  The  time  waveform  is  visually  more  consistent.  In  general,  the  adjustment 
made  little  difference  but  for  the  occasional  odd  filter  estimate,  compensation  made  a  substantial 
improvement. 

C.  Synthesizer  induced  transtevis 

The  fact  that  transients  can  be  generated  at  frame  boundaries  in  formant  synthesizers  is  well 
known.  To  reduce  the  effects  of  transients,  the  Klatt  synthesizer  has  code  to  correct  waveform  irreg¬ 
ularities  caused  by  sudden  changes  to  F  and  BW.  A  correction  proposed  by  Fujisaki  and  Azami  [10] 
is  implemented  in  the  synthesizer  to  alleviate  the  problem.  Careful  testing  of  this  correction  found 

^  Ultimately,  the  synthesizer  gain  is  a  fiinction  of  the  measured  rms  energy  of  the  signal,  the  mean  filter  power,  and 
an  overall  gain  par2uneter.  For  a  given  signal  with  rms  amplitude  rms  and  its  all-pole  filter  estimate  with  power 
spectrum  \H{w)\^  and  total  power  g^, 

filter.gain{db)  =  gain  —  201ogy  +  ZOlogrms  (1) 
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little  to  no  improvement  in  removing  transients.  When  the  error  correction  routine  was  removed, 
no  audible  difference  could  be  heard.  If  the  resonators  were  zero-stated  at  frame  boundaries,  the 
extreme  magnitude  transients  were  removed.  Transients  at  frame  boundaries  still  exist,  but  at  least 
are  not  visible  spikes  in  the  waveform.  One  undesirable  artifact  of  this  method  of  reducing  transients 
is  that  a  periodic  component  is  audible  at  the  frame  frequency.  This  effect  is  reduced  by  applying 
compensation  only  at  locations  where  large  transients  are  likely  to  occur. 

D.  Spectral  Variation 

The  formant  tracking  algorithm  is  one  unavoidable  source  of  errors  in  the  analysis  stage.  Many 
formant  tracking  algorithms  exist,  each  performing  best  in  different  situations.  Regardless  of  the 
algorithm,  errors  will  be  present.  All  that  can  be  done  is  to  detect  and  correct  the  obvious  errors 
when  they  occur.  The  formant  tracking  algorithm  presently  in  use  is  proposed  by  Secrest  and 
Doddington  [9].  Occasionally,  this  algorithm  will  incorrectly  produce  two  near  coincident  formant 
tracts  resulting  in  abnormally  high  amount  of  resonance  at  a  given  frequency.  The  best  solution  is  for 
the  formant  tracking  algorithm  not  to  produce  such  estimates.  A  suboptimal  solution  is  to  watch  for 
coincident  formant  tracts  and  substitute  an  adjacent  frame  estimate  in  place  of  the  aberrant  frame 
estimate. 

Even  good  formant  track  estimates  contain  some  variability  in  pole  location  estimates.  Un¬ 
fortunately,  this  variability  is  audible  and  very  noticable  in  steady-state  vowels.  To  qualitatively 
determine  the  extent  of  variation  in  pole  estimates,  random  parameter  slices  for  /i/  and  /a/  analyzed 
with  a  25ms  window  were  used  to  synthesize  non-varying  whisper  tokens.  Although  each  token  is  a 
reasonable  reproduction  of  the  whispered  vowel,  there  are  substantial  differences  in  whisper  quality 
between  the  different  slices.  This  is  most  likely  due  to  variability  in  the  bandwidth  estimates.  Vowel 
quality  remains  relatively  consistent  except  for  a  few  cases  in  which  there  is  a  large  deviation  in 
formant  frequency  values.  To  further  investigate  the  effects  of  spectral  variability,  the  above  wave- 
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Figure  2:  LPC  spectrum  for  50  msec  frames  of  single  steady-state  vowel. 

forms  were  concatenated  together  and  played,  simulating  changes  in  synthesizer  parameters  across 
frame  boundaries  without  potential  filter  transients.  Admittedly  the  abrupt  boundaries  may  have 
added  some  noise  but  hopefully  not  too  much.  The  files  sound  much  like  the  original  synthesized 
tokens,  full  of  burps  and  clicks.  The  conclusion  is  rather  clear:  significant  variability  in  the  spectral 
estimates  (formant  parameter  selections)  make  a  major  contribution  to  the  audible  transients. 

To  illustrate  spectral  variability  in  formant  estimates,  the  formant  tracker  derived  Ipc  spectrum 
of  50msec  frames  of  a  steady  state  vowel  are  displayed  in  Figure  2.  The  general  spectral  shape  is 
maintain  in  every  trace  but  slight  variations  in  formant  center  frequency  and  bandwidth  alter  the 
entire  spectrum.  As  observed  earlier,  the  overall  quality  and  timbre  of  each  spectrum  is  different 
and  it  is  not  surprising  that  all  of  these  sounds  jumbled  together  do  sound  poor. 
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VI.  CONCLUSIONS 


Whispered  speech  is  an  often  overlooked  mode  of  communication  with  properties  that  make  it 
attractive  for  experiments  in  speech  synthesis.  The  properties  of  whisper  speech  enable  the  study 
of  vocal-tract  dynamics  independent  of  source-tract  interactions.  Also,  by  comparing  the  synthesis 
of  whispered  speech  to  that  of  phonated  speech,  some  effects  of  source-tract  interactions  can  be 
illustrated.  A  review  of  previous  whisper  research  shows  substantial  study  of  the  perception  and 
acoustics  of  whispered  speech  but  surprisingly  little  research  of  the  synthesis  of  whispered  speech. 
We  believe  that  the  study  of  whisper  synthesis  will  provide  insights  into  the  synthesis  of  speech  in 
general. 

To  commence  our  study,  an  analysis  synthesis  system  for  whispered  speech  is  constructed.  This 
document  primarily  describes  the  developement  of  the  synthesis  system.  The  anaiysis/resynthesis 
system  encodes  real  whispered  speech  in  terms  of  parameters  that  drive  a  speech  synthesizer.  Ideally, 
the  speech  synthesized  from  the  encoded  parameters  should  sound  identical  to  the  original  utterance. 
In  practice,  this  is  not  always  possible. 

Frame  based  analysis  and  synthesis  has  proven  to  be  a  more  difficult  task  than  expected,  for  both 
voiced  and  whispered  speech.  Initial  attempts  were  disappointing.  Synthesis  of  steady-state  whis¬ 
pered  vowels  sound  gargly  and  full  of  transients.  The  sources  of  much  of  the  transient  behavior  have 
been  identified:  frame  boundary  transients,  poor  formant  tracking,  discrete  amplitude  variations, 
and  the  inherent  variability  of  spectral  estimates.  Careful  tuning  of  the  algorithm  that  interfaces 
a  formant  tracking  routine  to  the  Klatt  formant  synthesizer  results  in  a  fair  analysis/resynthesis 
system. 

The  ultimate  difficulty  lies  in  the  spectral  variability  of  the  Ipc  estimates.  Each  estimate  by  itself 
produces  good  sounding,  intelligible  synthetic  sounds,  but  putting  a  bunch  of  good  but  different 
estimates  next  to  each  other  produces  poor  sounds.  An  improved  formant  tracking  algorithm  would 


prevent  many  of  these  problems.  In  addition,  better  use  of  formant  tracking  results  could  also 
improve  synthesis  while  relaxing  performance  requirements  demanded  from  the  tracking  algorithm. 

The  analysis/resynthesis  system  will  be  used  as  a  tool  in  the  study  of  whispered  speech.  Since 
the  system  does  not  perform  as  well  as  expected,  truly  automatic  analysis  synthesis  will  not  be 
attempted.  Whisper  synthesis  will  ultimately  require  some  hand  tuning  of  the  analysis/resynthsis 
output. 
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Abstract 


Intelligent  tutors  have  the  potential  to  enhance  training  in  avionics  troubleshooting  by  giving 
students  more  experience  with  specific  problems.  Part  of  their  success  will  be  associated  with  their 
ability  to  assess  and  diagnose  the  students'  knowledge  in  order  to  direct  pedagogical  interventions. 
The  goal  of  the  research  program  described  here  is  to  develop  a  methodology  for  assessment  and 
diagnosis  of  student  knowledge  of  fault  diagnosis  in  complex  systems.  Along  with  this  broad 
goal,  the  methodology  should:  (1)  target  system  knowledge,  (2)  provide  rich  representations  of 
this  knowledge  useful  for  diagnosis,  (3)  be  appropriate  for  real-world  complex  domains  like 
avionics  troubleshooting,  and  (4)  enable  assessment  and  diagnosis  to  be  carried  out  on-line.  In 
order  to  meet  these  requirements  a  general  plan  for  mapping  student  actions  onto  system 
knowledge  is  proposed  and  research  from  one  part  of  this  plan  is  presented.  Results  from  a 
Pathfinder  analysis  on  action  sequences  indicate  that  action  patterns  can  be  meaningfully 
distinguished  for  high  and  low  performers  and  that  the  patterns  reveal  specific  targets  for 
intervention.  Short-  and  long-term  contributions  of  this  work  arc  also  discussed. 


23-2 


SEE  VOLUME  2.  REPORT  NO.  6, 

FOR  COMPLETE  TEXT  OF  THIS  REPORT. 


Development  at  a  Research  Paradigm  to  Study 
Collaboration  in  Multidisciplinary  Design  Teams 


Matyalice  Citera 
Assistant  Prctfessor 
Department  of  Psychology 
Wright  State  University 


Jonathan  A  Selvaraj 
Graduate  Research  Assistaitf 
Department  of  Psychology 
Wright  State  Uiuversity 


Final  Report  for 

AFOSR  Summer  Research  Program 
Armstrong  Laboratory 


Sponsored  by: 

Air  Force  OtQce  of  Scientific  Research 
Bolling  Air  Force  Base,  Washington,  D.  C. 


September  1992 


Development  of  a  Research  Paradigm  to  Study 
CoUabt^tion  in  Multidisciplinary  Design  Teams 

Maxyalice  Citera 
Assistant  Professor 
Department  of  Psychology 
Wright  State  University 

Jonathan  A.  Selvaraj 
Graduate  Research  Assistant 
Department  of  Psychology 
Wright  State  University 

Abstract 

The  purpose  of  this  research  project  was  to  develop  a  research  paradigm  to 
investigate  the  collaborative  process  of  design  in  multidisciplmary  teams.  Two  phases  of  task 
development  are  described.  The  initial  phase  mvcdved  identifying  a  design  problem  that 
could  be  used  to  create  the  ercperimental  task.  The  problem  selected  was  the  design  of  a 
navigation  system  for  an  automobile.  The  second  phase  involved  collecting  knovidedge  about 
the  problem  to  make  the  task  as  realistic  and  interesting  as  possible.  Knowledge  was 
collected  fitom  design  erqperts  using  a  concept  ntapping  technique.  The  results  hi^^ghted 
rtumy  tradeoffs  and  design  issues  that  could  be  integrated  into  an  erqperimental  task.  Future 
steps  necessary  tor  producing  the  design  paradigm  are  described. 
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Norman,  CMcIahoma  73019 

Abstract 

The  National  Aeronautics  and  Space  Administration  (NASA)  is  currently  conducting  a 
series  of  space  shuttle  launches  to  enable  scientists  to  study  the  effects  of  microgravity  on  a 
variety  of  factors.  Included  in  the  second  International  Microgravity  Laboratory  mission  (IML- 
2)  will  be  an  extensive  study  of  the  effects  of  microgravity  on  astronaut  cognitive  performance 
ability.  The  Sustained  Operations  Branch  of  the  USAF  Armstrong  Laboratory  (AL/CFTO)  has 
primary  responsibility  for  this  effort.  This  large  collaborative  study  will  include  the  training 
and  testing  of  astronauts  on  a  battery  of  human  cognitive  performance  tests  prior  to  launch, 
periodically  during  the  space  mission,  and  after  the  flight. 

To  permit  an  accurate  identification  of  performance  decrements  caused  by  microgravity 
in  space,  it  is  essential  to  collect  stable  pre-flight  baseline  data.  A  preliminary  investigation  was 
conducted  to  determine  the  impact  on  baseline  performance  stability  of  less  than  optimal 
practice  schedules  and  testing  lapses  due  to  such  factors  as  launch  delays. 

A  total  of  21  subjects  were  trained  on  the  NASA  Performance  Assessment  Battery  and 
then  assigned  to  one  of  five  practice  schedules.  Two  groups  practiced  each  day  for  15 
consecutive  days.  Two  other  groups  followed  a  schedule  of  5  days  testing,  2  days  off,  5  days 
testing,  3  days  off,  5  days  testing.  The  fifth  group  followed  a  schedule  of  2  days  testing,  5 
days  off,  2  days  testing,  5  days  off,  2  days  testing.  Then,  either  three  days  or  five  days  after 
the  last  practice  session,  subjects  returned  for  five  days  of  retesting  to  represent  mission  days. 

The  study  confirmed  the  overriding  importance  of  providing  an  adequate  number  of 
practice  sessions  to  achieve  performance  stability.  By  comparison,  occasional  missed  sessions 
(i.e.,  the  5-on,  2-off  schedules)  had  little  impact  on  ultimate  performance  at  the  end  of  practice. 
The  data  indicated  a  possible  performance  difference  between  those  subjects  with  only  a  3-day 
gap  between  practice  and  "mission  days"  vs.  those  with  a  5-day  gap.  High  levels  of 
differential  stability  and  reliability  were  observed  for  at  least  one  measure  on  all  tests  but  the 
Critical  Tracking  test.  Excellent  software  reliability  was  demonstrated  by  less  than  0.02% 
missed  data  collection  points. 
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PC  Based  Cardiovascular  Model  for  Displaying  Acceleration  Stress 


Frank  C.  Smeeks 
Medical  Student 
Mehany  Medical  College 


Abstract 

Acceleration  on  the  human  body  is  known  to  cause  several  physiological  effects. 

A  PC  based  interactive  computer  model  of  the  cardiovascular  system  was  developed  for 
studying  the  system  response  to  acceleration  stress.  The  PC  model  can  be  used  in 
educating  pilots  of  high  performance  aircraft  and  visualizing  acceleration  stress  tests 
without  physically  performing  them.  The  model  consists  of  simulations  of  the  heart, 
arterial  vasculature,  venous  system,  and  peripheral  circulation.  The  calculations  for  each 
component  of  the  model  include  the  effects  of  gravitational  acceleration  and  compensatory 
mechanisms(both  physiological  and  externally  applied).  Using  this  model,  ^./essures  and 
flows  at  different  points  of  the  cardiovascular  system  can  be  calculated  and  displayed.  By 
observing  pressures  and  flows  at  different  points  of  interest(e.g.  the  ophthalmic  artery 
pressure),  protective  techniques  can  be  investigated.  The  observations  allow  pilots  and 
scientists  to  experiment  on  optimizing  these  protective  procedures  without  performing 
unnecessary  testing.  Thus,  the  model  saves  undo  stress  on  pilots  and  experimental  costs. 
The  model  is  useful  in  the  identification  of  pilot  dysfunction  and  in  the  development  of 
new  protective  maneuvers  in  response  to  acceleration  stress  on  the  cardiovascular  system. 
The  availability  and  affordability  of  the  PC  make  this  model  scientifically  and  educationally 
indispensable. 
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Title 


PC  Based  Cardiovascular  Model  for  Displaying  Acceleration  Stress 


Aathor  Frank  C.  Smeeks,  Mehany  Medical  College 
Introduction 

Under  most  conditions  the  acceleration  due  to  gravity,  g,  is  constant.  In  aviation, 
however  this  is  not  necessarily  the  case.  During  aviation  maneuvers,  the  human  body  may 
be  exposed  to  additional  accelerations  which  may  have  dramatic  physiological 
consequences.  United  States  Air  Force  pilots  of  high  performance  aircraft  are  exposed  to 
acceleration  forces  exceeding  their  physiological  tolerance  limit.  At  high  acceleration  g 
values,  the  pressure  due  to  the  column  of  blood  between  the  heart  and  brain  may  exceed 
the  pressure  generated  by  the  heart.  The  cerebral  blood  vessels  collapse  and  the  pilot 
suffers  'blackout'  due  to  impaired  retinal  circulation. 

The  cardiovascular  system  is  suited  to  modeling  attempts  because  of  its 
dependence  on  well  understood  hydraulic  principles  and  because  of  the  wealth  of 
understanding  about  the  various  constituent  organs  involved  in  blood  transport.  A  PC 
based  compaitmental  model  which  models  and  displays  the  pressures  in  the  human 
cardiovascular  system  was  developed  for  multiple  purposes.  The  model  serves  the 
function  of  educating  pilots  by  visual  display  of  physiologic  phenomena  associated  with 
aerospace  maneuvers,  allows  scientis+s  to  visualize  the  results  of  possible  experiments 
without  actually  performing  the  test,  and  connects  the  worid  of  cardiovascular  modeling 
to  the  PC  domain. 

Description  of  the  Model 

The  cardiovascular  system  is  a  distributed  system  composed  of  blood  with  varying 
non  linear  viscous  and  inertial  properties  interacting  with  vessels  of  complex  morphology, 
non  linear  elastic  walls,  and  multiple  levels  of  homeostatic  systems  controlling  many 
distributed  properties.  Material  properties  of  components  of  vascular  walls,  blood  flow  in 
branching  tubes,  distributed  impedance  of  arterial  and  pulmonary  trees,  force-length- 
velocity  relationship  in  the  heart,  and  hemodynamic  control  of  the  syston  are  some  of  the 
componoits  which  must  be  considered.  Guyton  et  al.  developed  refined  computer  models 
of  the  cardiovascular  system  over  longer  time  scales,  and  has  built  an  incredibly  complex 
model  which  embodies,  in  over  350  equations,  the  behavior  of  inter^rting  cardiovascular, 
renal,  and  respiratory  systems  (Guyton  1972).  The  use  of  linear  approximations  and 
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lumped  parameters  allows  a  much  simplified  description  in  terms  of  linear  ordinary 
differential  equations.  Such  a  model  was  converted  fi'om  a  text  based  PC  program  to  a 
user  fiiendly  interactive  data  display  model  (Moore  and  Jaron  1991). 

The  model  consists  of  simulations  of  the  heart,  arterial  vasculature,  venous  system, 
and  peripheral  circulation.  A  block  diagram  of  the  model  is  given  in  Figure  1 .  The  model 
included  compensatory  mechanisms  which  pilots  use  to  offset  the  affect  of  acceleration 
stress  on  the  cardiovascular  system  (externally  applied  pressure,  muscular  straining,  and 
positive  pressure  breathing).  Physiological  mechanisms  including  carotid  baroreceptor 
control  of  the  heart  rate  and  venous  tone  were  also  included  in  the  model.  The  ventricular 
model  is  based  on  the  1974  Suga  and  Sagawa  variable  elastance  model.  The  arterial 
system  and  venous  system  model  consisted  of  segments  each  with  its  own  resistance, 
compliance,  and  inertance  characteristics.  The  parameters  for  both  were  obtained  using 
calculations  based  on  the  Navier-Stokes  equations.  The  peripheral  circulation  connecting 
arterial  and  venous  segments  are  modeled  by  lumped  representations.  The  compensatory 
mechaiusms  were  modeled  by  adding  a  pressure  component  to  apply  the  appropriate 
external  pressure  to  a  vessel  segment  through  its  compliance.  Physiologic  mechanisms 
were  simulated  by  an  adaptation  of  a  1973  model  by  Green  and  Miller. 

This  model  although  straightforward  for  a  scientist  or  computer  programmer  to 
use,  was  ineffective  for  an  end  user.  The  shortcomings  of  the  program  were  the  inability 
to  visualize  the  data  and  the  inability  to  chose  data  to  report  without  program 
modification.  The  existing  interface  was  a  primitive  menu  driven  interface  with  a  relatively 
poor  level  of  interaction.  These  shortcomings  were  overcome  by  creating  a  user  friendly 
interactive  interface  between  the  mathematical  component  and  the  end  user.  The  user 
interface  consisted  of  a  compartmental  model  which  was  displayed,  a  maneuver  choice 
display,  and  a  simulated  chart  recorder  display.  All  selection  input  was  controlled  by  point 
and  click  mouse  function  in  a  graphical  environment  and  user  input  was  accepted  via 
keyboard  in  a  graphic  environment. 

Results 

In  order  for  the  model  to  be  useful  for  teaching,  it  must  be  sophisticated  enough  to 
embody  behavior  being  taught,  paint  a  valid  picture  of  the  real  world,  and  allow  changes 
to  system  parameters.  The  revised  PC  based  model  provided  a  pseudo  real  time 
simulation  with  an  intuitive  interface  with  a  "virtual  patient"  appearance.  The  user 
interface  served  the  purposes  of  hiding  impl«nentation  details,  allowing  students  to 
manipulate  the  system  to  any  conceivable  hemodynamic  pathology,  structuring  the  model 
in  an  intuitive  manner,  reflecting  the  dynamic  nature  of  the  system,  and  maximizing 
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flexibility  by  allowing  students  to  monitor  any  hemodynamic  variable  of  interest.  Several 
user  interface  objects  were  developed  to  satisfy  these  pur  ises.  The  window  provides  a 
view  of  some  data  object  inside  the  computer  (e.g.  the  compaitmental  model).  An  icon 
provides  a  reminder  to  the  user(e.g.  the  F-16  aircraft).  The  dialog  box  receives  input  from 
the  user(e.g.  the  main  menu  section). 

High  performance  aircraft  can  withstand  maneuvers  that  the  human  body  is 
incap^le  of  withstanding.  Using  the  interactive  PC  based  model  the  pilot  can  enter 
aircraft  g  accelerations  and  monitor  the  human  physiology  response.  Pilots  visualize  on 
screen  via  the  strip  chart  recorder  what  is  occurring  to  the  hunuui  body  during  the 
requested  high  performance  aircraft  maneuvers  and  compensatory  mechanisms.  This  can 
help  pilots  in  the  decision  of  which  compensatory  mechanisms  to  employ  or  if  no 
compensatory  mechanisms  would  be  of  use.  The  dialog  between  the  student(pilot)  and 
the  computer  via  the  interface  satisfies  the  educational  objective. 

With  respect  to  research,  studies  of  important  hemodynamic  events  under 
conditions  of  acceleration  stress  are  not  easily  accomplished.  Invasive  measurements  in 
human  subjects  are  often  not  possible  and  no  suitable  animal  models  exist.  Acceleration 
studies  require  a  controlled  acceleration  source  and  few  human  centrifuges  exist.  Most 
human  experiments  rely  on  indirect  and  subjective  measurements  yielding  unreliable 
experimental  results.  The  PC  based  mathematical  model  provides  insight  to  the  response 
of  the  cardiovascular  system  to  acceleration  stress.  By  observing  the  results,  the  model 
can  provide  insight  to  the  causes  of  pilot  impairment.  This  can  lead  to  the  postulation  of 
solutions  to  pilot  impairment.  Protection  tcchniques(effectual  and  ineffectual)  can  be 
modeled  and  applied  to  the  subject  without  the  requirement  of  human  experimentation. 
Those  techniques  which  prove  mathematically  effectual  can  then  be  tested  using  the 
human  centrifuge.  The  use  of  a  computer  model  to  screen  techniques  saves  undo  human 
experimentation  and  cost.  Cost  savings  may  be  realized  by  saving  one  high  performance 
aircraft  from  loss  due  to  undetected  pilot  impairment  during  a  centrifuge  experiment.  In 
addition  to  these  savings,  is  the  possibility  of  design  and  optimization  of  new  and  existing 
techniques.  The  following  paragraphs  demonstrate  some  sample  model  results. 

Figure  2  shows  the  model  results(display)  for  a  rapid  onset  run(ROR).  This  refers 
to  a  Haversian  shaped  increase  in  g  from  1.0  to  a  plateau  value  of  4.S,  a  time  at  the 
plateau,  and  then  a  Haversian  decline  to  1 .0  g.  The  first  symptom  of  falling  arterial 
pressure  to  the  head  is  impairment  of  vision.  Loss  of  peripheral  vision  occurs  at  50  mmHg 
systole  and  complete  loss  of  vision  near  20  mmHg.  The  simulation  indicates  that  the 
subject  would  experience  'blackout'  and  not  recover  until  the  acceleration  stress  is 
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Figure  3.  Model  output  under  acceleration  stress  profile  having  a  maxiinuni  of  4  S  g  and 
standard  G-suit  applied  at  1 .5  g. 
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removed.  Figure  3  shows  the  mode!  display  for  the  same  4.5  g  Haversian  profile  with  a 
standard  G-suit  as  the  employed  compensatory  maneuver.  The  prime  mechanism  of  the 
standard  G-suit  is  to  apply  1 .5  psi  to  the  calves,  thighs,  and  abdomen;  therefore, 
maintaining  a  higher  venous  return.  The  model  predicts  that  by  wearing  the  standard  G- 
suit  only  a  loss  of  peripheral  vision  results  as  opposed  to  complete  loss  of  vision  without 
the  suit.  Further  experimentation  with  a  pulsating  G-suit  with  the  4.5  g  Haversian  profile 
shows  no  impairment  of  vision.  Experimental  studies  on  the  human  centrifuge  confirm  all 
of  the  model  findings. 

Conclusions 

Experimental  results  suggest  that  the  PC  based  model  is  a  useful  tool  for  studying 
the  cardiovascular  system  responses  to  acceleration  stress.  The  complexity  of  the 
components  mentioned  earlier  prevent  an  exact  replica  of  the  cardiovascular  system; 
however,  the  model  is  valuable  in  the  identification  of  sources  of  pilot  dysfunction  and  as  a 
guide  to  methods  of  improving  tolerance  to  gravitational  stress.  The  modular  PC  program 
can  easily  be  modified  to  incorporate  and  test  new  methods.  The  availability  and  afford 
ability  of  the  PC  to  physiologists  in  the  lab  and  to  pilots  in  the  classroom  make  the  PC 
based  model  indispensable. 

Future  Directions 

Future  directions  for  the  program  include  the  incorporation  of  alternate 
profiles(simulated  air  combat  maneuvers),  heart  rate  control,  addition  of  a  pulmonary 
system  model,  display  of  flow  data  concurrem  with  pressure  data,  and  manipulation  of 
parameters  from  the  menu  for  particular  systems  compartmentsf  resistances  and 
capacitances). 
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Abstract 

Twelve  food-  and  water-deprived  rats  chose  between  two  levers.  A  multiple 
fixed-ratio  49  fixed-ratio  1  schedule  was  associated  with  one  lever  and  a 
multiple  fixed-ratio  25  fixed-ratio  25  was  associated  with  the  other.  In  Phase 
1  for  both  levers,  one  of  the  two  components  defining  the  multiple  schedule 
delivered  access  to  0.1 -cc  of  water  while  the  other  component  delivered  a 
single  45-mg  food  pellet.  The  order  of  food  and  water  presentations  was 
counterbalanced  across  subjects.  To  prevent  absolute  preference  for  an 
alternative  from  developing,  the  values  of  each  multiple  schedule  were 
adjusted  according  to  a  titration  schedule:  If  a  multiple  schedule  was  selected 
four  times  in  succession,  its  ratio  values  were  incremented.  In  Phase  2,  half 
the  rats  were  exposed  to  only  water  reinforcement,  while  the  other  half 
received  only  food  reinforcement.  In  all  other  ways,  the  experiment  was 
unchanged  from  Phase- 1  conditions.  There  was  no  reliable  change  in 
preference,  an  outcome  incompatible  with  the  economic  notion  that  organisms 
prefer  mixtures  of  goods  over  unitary  presentations  of  a  good. 
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Abatract 

Tha  ralationehip  batwaan  famala  and  mala  situational  awaranass  (SA)  was 
studied.  The  test  battery  administered  was  comprised  of  the  following i  Armed 
Services  Vocational  Aptitude  Battery  (ASVAB),  Space  Fortress  (SF),  and  Hartman 
Test  (HT) .  Mala  Sa  scored  slightly  higher  than  female  Ss  on  the  majority  of 
testing  however  analysis  measuring  the  significance  of  this  was  incomplete. 
Correlations  that  were  significant  were  also  quite  low  i.e.  .1  to  .4  suggest¬ 
ing  the  relationship  was  weak.  While  scores  varied  widely  among  both  groups 
of  Ss,  further  analyses  is  required  to  determine  the  potential  for  future 
pilot  selection  through  such  test  tools. 
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SITUATIONAL  AWARENESS  CORRELATES 
(A  PILOT  STUDY) 

Lorrain*  C.  Willians 

Cognitive  Psychologista  have  long  been  interested  in  the  concepts  and 
complex  workings  of  attention  and  memory.  Opposing  theories  regarding  re¬ 
sources,  mechanisms,  processes,  autcxnaticity,  workload  and  other  issues  reaiain 
unresolved,  and  as  such  require  further  study. 

Aviation  psychology  also  concerns  itself  with  the  above  disputes.  Partic¬ 
ularly  of  interest  is  how  these  things  relate  to  existing  aviation  problems  of  a 
cognitive  nature  such  as  accidents  due  to  pilot  error.  The  approach  taken  so 
far  in  the  field  is  one  that  considers  memory  and  attention  as  trainable,  with 
the  objective  of  making  performance  in  a  specific  domain  automatic.  Automat ici- 
ty  is  assumed  to  decrease  workload  thus  allowing  increased  resources  to  be 
available,  which  is  assumed  to  lead  to  an  increased  awareness  state  resulting  in 
improved  human  performance.  This  increased  awareness  state  is  referred  to  as 
situational  awareness  (SA)  and  has  several  working  definitions  to  date. 

Situational  Awareness  (SA)  in  the  aviation  comnunity  can  generally  be 
defined  as  "the  pilot's  mental  model  of  the  world  around  him"  (Endsley, 

1990:41).  It  has  also  been  referred  to  as  "the  knowledge  that  results  when 
attention  is  allocated  to  a  zone  of  interest  at  a  level  of  abstraction"  (Frack- 
er,  1988:102). 

For  purposes  of  this  study,  the  following  will  be  used  as  a  working  defi¬ 
nition:  "Situational  awareness  is  principally  (though  not  exclusively)  cogni¬ 

tive,  enriched  by  experience"  (Hartman  &  Secrist,  1991:1084).  It  "embodies  a 
remarkable  cognizance  of  the  total  c:xnbat  situation,  the  capacity  to  anticipate 
rath«tr  than  react  to  change,  and  the  ability  to  make  valid  intuitive  decisions 
under  conditions  of  great  time  urgency  and  stress"  (Secrist,  et.al.,  1991:1). 
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situational  awareness  skills  include  the  following:  1)  keen  sensitivity  to 
critical  cues,  2)  early  acquisition  of  cue  patterns,  3)  rapid  cue  integration, 

4)  high-speed  pattern  processing,  5)  instantaneous  situation  assessment,  6) 
direct  trend  apprehension,  7)  anticipatory  judgment,  8) intuitive  decisions,  and 
9)  response  automaticity.  See  Secrist,  et.al.,  (1991)  for  details. 

It  has  been  proposed  that  these  qualities  may  be  already  reasonably  de¬ 
veloped  in  some  individuals  such  as  those  top  pilots  (approximately  5%)  who  have 
historically  been  responsible  for  40%  of  total  kills  (Youngling,  et.al.,  1977). 
It  is  further  suggested  from  previous  research  that  these  skills  are  trainable, 
and  will  contribute  significantly  to  a  highly  competent  level  of  performance 
(Hartman  &  Secrist,  Endsley,  1987,  1988b,  1990). 


THE  PROBLEM 

To  increase  the  currently  small  number  of  top  fighter  aces,  there  is  a 
need  to  make  certain  each  aviator  is  performing  at  their  full  potential.  To 
date  limited  studies  with  small  samples  have  been  done  in  developing  the  SA 
concepts  and  model  of  testing;  therefore  there  is  a  need  to  conduct  further 
studies  of  reliability  and  validity  for  existing  protocols  and  compare  them 
against  each  other.  There  are  many  approaches  to  this  problem;  I  have  chosen  to 
examine  the  feasibility  of  selecting  individuals  into  flying  training  programs 
who  have  situational  awareness  training  potential. 

METHODOLOGY 

Two  computer  administered  tests  were  used  to  assess  for  SA  skills.  These 
were  comprised  of  the  following: 

Space  Fortress  (SF)  -  This  tool  was  developed  in  the  Cognitive  Psychophys¬ 
iology  Laboratory  at  the  University  of  Illinois  at  Urbana-Champaign  as  "an 
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experimental  taek  for  the  study  of  complex  skill  and  its  acquisition”  (Mane*  & 
Dochin,  1989:17).  This  video  game  is  comprised  of  multiple  desianding  tasks 
yielding  a  set  of  subscores  as  well  as  a  total  score.  The  goal  is  to  keep  one’s 
ship  (controlled  by  a  standard  joystick)  in  control  at  all  times  while  maneuver¬ 
ing  away  from  en«ny  fire  and  attempting  to  destroy  the  enemy's  fortress  (using  a 
standard  keyboard).  See  Mans'  &  Cochin,  (1989)  for  details. 

Hartman  Teat  <HT1  -  This  task,  which  measures  visual  identification  in 
near-threshold  processing,  was  developed  by  Bryce  O.  Hartman  at  the  Armstrong 
Laboratory  Aerospace  Medicine  Directorate,  in  collaboration  with  Grant  Secrist. 
The  task  usee  four  visual  targets  (card  suits)  and  a  blank  (no  distractors)  with 
a  mask.  Each  subject  will  have  two  scores,  1)  reaction  time  and  2)  accuracy. 

See  Secrist  &  Hartman  (1992)  for  details. 

Procedure  -  The  testa,  although  self -administered,  were  monitored  by 
experienced  technicians.  Each  test  comes  with  instructions  and  built-in  prac¬ 
tice  sessions  as  part  of  the  software. 

The  SF  test  were  administered  first  over  45  minutes,  followed  by  a  three 
minute  break,  followed  by  22  minutes  of  the  HT  test.  There  was  one  15  minute 
break.  The  WT  task  was  administered  three  blocks  at  a  time  with  20  trials  per 
each  block.  Visual  stimuli  times  were  set  at  66,  52,  and  33  msec.  There  were 
two  blocks  given  at  each  stimulus  time  for  a  total  of  six  blocks. 

Data  Analysis  -  Descriptive  statistics  were  calculated  for  ASVAB,  SF,  and 
SA  scores.  These  data  were  analyzed  using  pearson  product  moment  correlation 
coefficients  to  determine  the  relationship  both  within  and  among  SF,  SA  and 
ASVAB  subscores  and  total  scores.  Varied  N's  represent  Ss  whos  testing  was 
incomplete.  A  corrected  matrix  produced  with  the  range-x  program  would  have 
been  appropriate  to  improve  credibility  of  the  estimates,  however;  this  was  not 
possible  due  to  tiaie  constraints. 
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RESULTS 


OwnooraphicB  -  Subjects  were  191  male  and  21  female  basic  AF  recruits 
ranging  in  age  from  18  to  26  with  a  mean  age  of  20.  Subjects  had  previously 
been  administered  the  Armed  Services  Vocational  Aptitude  Battery  (ASVAB)  test 
upon  entry  level,  thus  scores  were  obtained  from  data  files  (see  Ree,  et.  al  for 
details).  Subjects  were  tested  on  the  eleventh  day  of  their  six  weeks  of  basic 
training. 

Armed  Services  Vocational  Aptitude  Battery  fASVABV  -  The  sum  of  subtest 
scores  or  total  score  ranged  from  446  to  644  for  males  with  a  mean  of  550.2  and 
standard  deviation  of  36.3.  Total  scores  for  females  ranged  from  465  to  618 
with  a  mean  of  536.4  and  standard  deviation  of  43.6.  Males  scored  slightly 
higher  on  the  subtests  with  the  exception  of  three.  Females  scored  slightly 
higher  than  males  on  numerical  operational  (NO)  57.5  vs.  55.3;  coding  speed  (CO) 
60.4  vs.  54.7;  and  mathematics  knowledge  (MK)  58.0  vs.  56.3  (See  Table  1). 

Male  correlations  were  as  follows.  Significant  correlations  existed  for 
general  science  (GS)  and  various  subtests,  however  these  were  quite  low  (r  -  .24 
to  r  =  .54).  The  correlation  between  GS  and  word  knowledge  (WK)  was  r  »  .57,  p 
<  .001.  Arithmetic  reasoning  (AR)  was  significantly  correlated  with  various 
subtests  ranging  from  r  ^  r  =  .19  to  r  =  .39.  A  significant  correlation  of  r 
3  .6,  p  <  .001  existed  for  AR  and  MK.  There  were  no  significant  correlations  at 
r  =  .6  or  greater  for  the  remaining  subtests  (See  Table  2). 

Female  correlations  were  as  follows.  General  science  (GS)  was  signifi¬ 
cantly  correlated  with  AR  at  r  «  .77,  p  <.001;  MK  at  r  »  .65,  p  <  .001;  MC  at  r 
-  .78,  p  <  .001;  and  El  at  r  -  .63,  p  <  .001.  Other  significant  correlations 
were  quite  low  as  they  were  for  AR  and  HC,  WK  and  MC,  WK  and  El,  PC  with  AS,  MK, 
MC,  and  El,  NO  with  AS  and  MK,  AS  and  El,  and  Mk  and  MC.  Other  significant 
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correlationo  were  for  AR  and  MK,  r  •  .80,  p  <  .001;  WK  and  PC,  r  -  .70,  p 
<  .001;  WK  and  AS,  r  *  .71,  p  <  .001;  and  AS  and  MC,  r  -  .65,  p  <  .001.  No 
significant  correlations  existed  between  coding  speed  (CS)  and  any  other  subtest 
{ See  Table  2 ) . 


Table  1 

Means  and  Standard  Deviations  for 
Male's  and  Female's  ASVAB 
Scores 


Source 

Male 

Female 

General  Science  (GS) 

54.7 

51.8 

(6.8) 

(7.3) 

Arithmetic  Reasoning  (AR) 

55.0 

55.5 

(6.2) 

(6.9) 

Word  Knowledge  (WK) 

54.2 

54.3 

(4.6) 

(6.3) 

Paragraph  Comprehension  (PC) 

55.1 

55.1 

(5.2) 

(5.3) 

Numerical  Operations  (NO) 

55.3 

57.5 

(6.4) 

(4.9) 

Coding  Speed  (CO) 

54.7 

60.4 

(6.7) 

(6.7) 

Auto  &  Shop  Info.  (AS) 

54.6 

45.9 

(7.5) 

(7.0) 

Mathematics  Knowledge  (MK) 

56.3 

58.0 

(7.4) 

(6.9) 

Mechanical  Comprehension  (MC) 

57.0 

49.5 

(7.1) 

(8.6) 

Electronics  Info.  (El) 

53.6 

48.4 

(7.5) 

(6.2) 

Sum  of  Subteste  (SSS) 

550.2 

536.4 

(36.3) 

(43.6) 

N  >  184  males,  21  females 
Note:  SD  in  (  ) 
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Correlation  Matrix  for  Male's  and  Female 
ASVAB  Scores 


Table  2  Cont 
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Space  Fortress  (SF)  -  Block  one  total  scorea  for  malea  ranged  froa 
-41110.00  to  6936.00  with  a  mean  of  -15824.23  and  a  atandard  deviation  of 
9331.58.  Block  two  totals  ranged  from  -37582.00  to  21988.00  with  a  mean  of 
-11719.75  and  a  standard  deviation  of  11832.86  <See  Table  3). 

Table  3 

Means  and  Standard  Deviations  for 
Male's  Space  Fortress  Scores 


Source 

Block  1 

Block  2 

Points 

-9023.68 

-7150.34 

(5922.78) 

(8839.17) 

Velocity 

1556.60 

1969.22 

(1191.76) 

(1799.35) 

Control 

-1919.56 

-  439.31 

(5969.21) 

(6830.23) 

Speed 

-6437.59 

-6099.31 

(2347.94) 

(3458.73) 

Total 

-15824.23 

-11719.75 

(9331.58) 

(11832.86) 

N  =  191  Block  1,  189  Block  2 


Note:  SD  in  (  } 

Female  total  scores  for  block  one  ranged  from  -34212.00  to  -16611.00  with 
a  mean  of  -24168.67  and  a  standard  deviation  of  4755.76.  Block  two  totals 
ranged  from  -34947.00  to  -4497.00  with  a  mean  of  -20631.38  and  a  standard  devia¬ 
tion  of  7582.32  (See  Table  4). 

Females'  scores  improved  from  block  one  to  block  two  with  the  exception  of 
the  points  category  which  decreased.  Males  improved  in  all  categories  from 
block  one  to  block  two.  Males  scored  higher  overall  for  all  categories  with  the 
exception  of  speed  in  block  two  (See  Tables  3  and  4). 
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Tabl«  4 

Means  and  Standard  Oeviationa  for 
Female *8  Space  Foctreaa  Scorea 


Source 

Block  1 

Block  2 

Points 

-  914.48 

-9853.62 

(4378.35) 

(1914.08) 

Velocity 

916.33 

1206.67 

(533.67) 

(1532.27) 

Control 

-8473.05 

-6320.14 

(2859.21) 

(4978.09) 

Speed 

-7470.48 

-5664.29 

(1379.35) 

(3870.34) 

Total 

-24168.67 

-20631.38 

(4755.76) 

(7582.32) 

N  «  21 

Note:  SO  in  (  ) 


For  males,  (See  Table  5)  block  one  points  was  negatively  correlated  with 
block  one  velocity  atr='-.26,  p<  .001;  and  block  two  speed  at  r  ■  -.23,  p 

<  .001.  Block  one  and  two  points  were  positively  correlated  at  r  ■  .64,  p 

<  .001. 

Block  one  velocity  was  negatively  correlated  with  block  two  points  at  r  « 
-.15,  p  <  .05.  Velocity  one  was  positively  correlated  with  control  one  and  two 
at  r  s  .66,  p  <  .0001  and  r  «  .53,  p  <  .0001  respectively.  Velocity  one  was 
also  correlated  with  velocity  two  r  »  .73,  and  speed  two  r  »  .37,  p  <  .0001. 

Control  one  was  correlated  with  speed  one  and  two  r  *  .46,  p  <  .0001  and  r 
s  .29,  p  <  .0001  respectively.  Control  one  was  also  correlated  with  velocity 
two  at  r  >  .49,  p  <  .0001.  Control  one  and  two  were  correlated  at  r  «  .79,  p 
<  .0001. 

Speed  one  was  negatively  correlated  with  block  two  points  at  r  ■  -.27,  p 
<  .0001.  Speed  one  was  positively  correlated  with  velocity  two  r  »  .39;  control 
two  r  >  .35;  and  speed  two  r  »  .55,  p  <  .0001  for  all. 
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Block  two  points  wore  nsgstivoly  corrolatsd  with  voloeity  two  at  r  «•  >.30 
and  ap««d  two  at  -.52,  p  <  .0001.  Velocity  two  was  posit ivsly  corrslatad  with 
control  two  at  r  «  .61  and  spasd  two  at  r  .62,  p  <  .0001.  Contro.^  two  was 
corrslatsd  with  spasd  two  at  .46,  p  <  .0001. 

Tabl*  5 

Correlation  Matrix  for  Mala'a  Space  Forf-Ms 
Scores 


Pnts  1 

Vel  1 

Cntrl  1 

Spd  1 

Pnts  2 

Vel  2 

Cntrl  2 

Spd  2 

Pnts  1 

1.0 

-  .26* 

.05 

•  .55 

.64* 

-  .06 

.10 

-  .23** 

Vel  1 

-  .26* 

1.0 

.66* 

.52* 

-  .15*** 

.73* 

.53* 

.37* 

Cntrl  1 

.05 

.66* 

1.0 

.46* 

.11 

.49* 

.79* 

.29* 

Spd  1 

■  .55* 

.52* 

.46* 

1.0 

-  .27* 

.39* 

.35* 

.55* 

Pnts  2 

.66* 

.  .,5** 

.11 

-  .27* 

1.0 

•  .30* 

•  .05 

-  .52* 

Vel  2 

•  .06 

.73* 

.49* 

.39* 

-  .30* 

1.0 

.61* 

.62* 

Cntl  2 

.10 

in 

.79* 

.35* 

-  .05 

.61* 

1.0 

.46* 

Spd  2 

-  .23** 

.37* 

.29* 

.55* 

-  .52* 

.62* 

.46* 

1.0 

•  p  <  .0001 
•*  p  <  .001 

***  p  <  .05 

Key:  Pnts:  Points 
Vel:  Velocity 

Cntrl:  Control 

Spd:  Speed 

1:  Block  1 
2:  Block  2 

N  «  2  Missing  for  Block  2 

For  females,  (See  Table  6)  block  one  points  was  correlated  negatively  with 
speed  one  r  »  -.44  and  velocity  one,  r  ■  -.42,  p  <  .05.  Points  one  and  two  viere 
also  correlated  at  r  -  .59,  p  <  .001 
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Table  6 


Correlation  Matrix  for  F«Mle<e  Space  Fortraaa 
Scorea 


Pnt«  1 

Val  1 

Cntrl  1 

Spd  1 

Pntt  2 

V*l  2 

Cntrl  2 

Spd  2 

Pnt«  1 

1.0 

-  .63 

-  .32 

.59** 

-  .42**** 

-  .31 

•  .30 

V»l  1 

-  .63** 

1.0 

.68* 

.83* 

- 

.40**** 

.52*** 

.28 

Cfitrl  1 

-  .32 

.68* 

1.0 

.75* 

-  .20 

.31 

.68* 

.26 

Spd  1 

.83* 

.75* 

1.0 

-  .21 

.39**** 

.63** 

^44*n« 

Pnts  2 

.59*** 

-  .28 

-  .20 

-  .20 

1.0 

-  .91* 

-  .58*** 

-  .76* 

Vel  2 

-  .42**** 

^40**** 

.31 

.39**** 

-  .91* 

1.0 

.68* 

.88* 

Cntrl  2 

-  .31 

.52*** 

.68* 

.63** 

-  .58*** 

.68* 

1.0 

.65** 

Spd  2 

-  .30 

.28 

.26 

.44**** 

-  .76* 

.88* 

.65** 

1.0 

•  p  <  .0001 
**  p  «  .001 
•**  p  <  .01 

p  «  .05 

Velocity  one  was  correlated  with  control  one  at  r  ■  .68  and  apeed  one  at  r 
s  .83,  p  <  .0001.  Velocity  one  and  two  were  correlated  at  r  >  .40,  p  <  .05;  and 
control  two  r  =  .52,  p  <  .01. 

Control  one  was  correlated  with  speed  one  at  r  .75,  p  <  .0001.  Control 
one  and  two  were  correlated  at  r  -  .68,  p  <  .0001. 

Speed  one  was  correlated  with  speed  two  at  r  »  .44,  and  velocity  two  r 
»  .39,  p  <  .05.  Sneed  one  and  control  two  were  correlated  r  *>  .63,  p  <  .01. 

Points  two  was  negatively  correlated  with  velocity  two  at  r  «  -.91,  and 
apeed  two  at  r  «  -.76,  p  <  .0001.  Points  two  and  control  were  negatively  corre¬ 
lated  at  r  »  -.58,  p  <  .01. 

Velocity  two  was  correlated  with  control  two  at  r  »  .68  and  speed  two  at  r 
«  .88,  p  <  .0001.  Control  two  and  speed  two  were  correlated  at  r  »  .65,  p 
<  .001. 
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Hartman  Taak  <HT>  -  Males  scored  a  minimum  of  0%  and  a  maximum  of  100% 


correct  at  66  msec  for  block  one  versus  1.6%  to  100%  for  block  two.  Males 
scored  a  minimiim  of  0%  correct  and  a  minimum  of  100%  for  block  one  trials  at  33 
msec.  They  scored  a  minimum  of  6.6%  and  a  maximum  of  96.7  correct  for  block  two 
trials  at  33  msec. 

Females  scored  a  minimum  of  5%  correct  at  66  msec.  While  their  maximum 
was  100%  for  block  one.  Block  two  yielded  a  23%  minimum  and  a  100%  maximum. 
Block  one  trials  at  33  msec  resulted  in  a  16%  accuracy  minimum  and  a  68.33% 
maximum. 

Females  had  slightly  faster  reaction  times  overall,  however  male  Ss  had 
higher  accuracy  rates.  Both  male  and  female  Ss  improved  their  scores  from  block 
one  trials  to  block  two  except  for  males  who  scored  slightly  worse  at  33  msec. 
During  block  two  trials  (See  TaJale  7). 

Table  7 


Means  and  Standard  Deriations  for  Male 
and  Female  Hartman  Task  Scores 
_ Male _ ygfflflig 


_ Block  1 _ 

Block  2 _ 

_ Block  1 _ 

_ Block  2 _ 

%correct 

66  msec 

63.18 

71.91 

57.58 

69.75 

(21.86) 

(23.36) 

(20.92) 

(24.32) 

52  msec 

65.09 

67.58 

60.67 

66.83 

(23.52) 

(26.66) 

(20.94) 

(21.87) 

33  msec 

45.02 

45.33 

38.25 

40.33 

(22.82) 

(21.54) 

(13.28) 

(17.56) 

66  msec 

1017.64 

934.43 

979.80 

867.00 

(249.77) 

(186.18) 

(134.23) 

(94.47) 

52  msec 

962.20 

944.12 

916.85 

916.50 

(208.97) 

(187.44) 

(101.41) 

(131.18) 

.33  msec 

941.98 

982.61 

898.50 

896.20 

(260.31) 

(197.38) 

(174.84) 

(167.22) 

M  »  177  Males,  21  Females 
NOTE:  SD  in  (  ) 
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Cerrglational  Findinoa  -  Space  fortress  and  ASVAB  correlations  for  Males 
were  as  follows  (See  Table  8)  here  was  a  significant  correlation  between  GS  and 
control  onef  r  ®  .20,  p  <  .001.  General  science  was  correlated  with  velocity 
two,  r  =  .20  and  control  two  at  r  »  .20,  p  <  .01.  Block  one  points  and  GS  were 
correlated  at  .15,  p  <  .05. 

Table  8 

Correlation  Matrix  for  Malea'a  ASVAB  and 
Space  Fortreaa  Scorea 


Pntsi 

Veil 

Cntrll 

Spdl 

Pnts2 

Vel2 

Cntrl2 

Spd2 

GS 

.15**** 

.10 

.20** 

-.02 

.12 

.20*** 

.20*** 

.06 

AR 

.09 

.12**** 

.14**** 

.03 

.10 

.13**** 

.11 

.09 

WK 

.15**** 

.13**** 

.21** 

.02 

.06 

.21*** 

.20*** 

,18*** 

PC 

.12**** 

o 

o 

.10 

-.04 

.10 

.05 

.11 

-.00 

NO 

.01 

.08 

.12**** 

.08 

.06 

.00 

.04 

.02 

CS 

.08 

.01 

.07 

.06 

.06 

-.04 

-.04 

.06 

AS 

-.03 

.10 

.17 

.13**** 

-.01 

.08 

.06 

.07 

HK 

.25* 

.15**** 

.25* 

-.06 

.15**** 

.20*** 

.24** 

.10 

HC 

.07 

.09 

.30* 

.10 

.09 

.16**** 

.32* 

.14**** 

El 

.10 

.12 

.23** 

.08 

.04 

.23** 

.19** 

p  <  .0001 
p  <  .001 
p  <  .01 
p  <  .05 


Block  one  and  two  scores  were  correlated  with  AR  at  r  =  .12  and  r  »  .13 
respectively;  p  <  .05.  Control  one  and  AR  were  correlated  at  r  »  .14,  p  <  .05. 
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Word  knowledge  was  correlated  with  velocity  one  and  two  at  r  «  .13,  p 

<  .05;  and  r  »  .21,  p  <  .01.  Control  one  and  two  were  correlated  with  NK  at  r 
B  .21,  p  <  .01  and  r  >  .20,  p  <  .001.  Blrck  one  points  were  correlated  with  NK 
at  r  =  .15,  p  <  .05  while  WK  and  speed  two  were  correlated  at  r  *  .18,  p<  .001. 

Paragraph  comprehevision  was  correlated  with  block  one  points  at  r  «  .12, 
p  <  .05.  Numerical  operations  was  correlated  with  control  one  at  r  «  .12,  p 

<  .05.  There  were  no  significant  correlations  between  coding  speed  and  any  SF 
subtest  scores. 

Block  one  speed  and  AS  were  correlated  at  r  >  .13,  p  <  .05.  Mathematical 
knowledge  with  points  and  control  one  were  correlated  both  at  r  =  .25,  p 

<  .0001.  Mathematical  knowledge  was  correlated  with  control  two  at  r  »  .24,  p 

<  .001  and  with  velocity  two  at  r  .20,  p  <  .01.  Block  one  velocity  and  points 
two  were  correlated  with  MK  both  at  r  «  .15,  p  <  .05. 

Mechanical  comprehension  was  correlated  with  control  one  and  two  at  r 
=  .30  and  r  =  .32,  p  <  .0001.  Velocity  two  was  correlated  with  MC  at  .16  and 
speed  two  and  MC  were  correlated  at  r  »  .14,  p  <  .05  for  all. 

Electronic  information  was  correlated  with  control  one  and  two,  both  at  r 
.23,  p  <  .001.  It  was  also  correlated  with  speed  two  at  r  -  .19,  p  <  .001  and 
with  velocity  two  at  r  =  .16,  p  <  .05. 

Significant  space  fortress  and  ASVAB  correlations  for  female  Ss'  were  as 
follows  (See  Table  9).  General  science  was  correlated  with  points  one  at  r  = 
-.39  and  control  one  at  r  »  .39,  p  <  .05  for  both. 

Numerical  operations  was  negatively  correlated  with  velocity  one  at  r  > 
-.41,  p  <  .05.  Coring  speed  was  correlated  with  velocity  two  at  r  «  .39,  p 

<  .05. 

Mathematical  knowledge  was  correlated  with  velocity  and  control  one 
at  r  »  .39  and  r  .38,  p  <  .05.  There  were  no  significant  correlations  between 


space  Fortreea  aubteet  scores  and  arithtnetic  reasoning,  word  knowledge,  para¬ 
graph  coaprehension,  auto  and  shop  information,  mechanical  comprehension,  or 
electrical  information. 


Tabl*  9 

Correlation  Matrix  for  Faaela's  ASVAB 
and  Space  Fortreea  Scores 


Pntel 

Veil 

Cntrll 

Spdl 

Pnts2 

Vel2 

Cntrl2 

Spd2 

GS 

-.39**** 

.28 

.39**** 

.31 

-.20 

.15 

.15 

.25 

AR 

-.36 

.33 

.36 

.23 

-.30 

.30 

.27 

.29 

MC 

-.17 

-.05 

.03 

.01 

-.19 

.08 

-.04 

.24 

PC 

.06 

.01 

.17 

.22 

.11 

-.05 

-.11 

.18 

NO 

-.33 

-.41**** 

.36 

.37 

-.23 

.26 

.36 

.23 

CS 

-.32 

.37 

.16 

.33 

-.31 

.39**** 

.25 

.17 

AS 

-.27 

-.07 

.14 

-.01 

-.11 

-.07 

-.19 

.05 

MK 

-.21 

.39**** 

,3g**«* 

.37 

-.18 

.27 

.29 

.29 

MC 

-.18 

.18 

.30 

.34 

-.10 

.16 

.13 

.35 

El 

-.21 

.09 

.29 

.20 

-.13 

-.07 

-.02 

.19 

***• 


p  <  .0001 
p  <  .001 
p  <  .01 
p  <  .05 


No  significant  correlations  existed  between  SF  subscores  and  HT  response 
times  for  male's  block  one  testing.  Significant  correlations  existed  between 
all  SF  subscores  and  percent  correct  at  52  msec,  ranging  from  .14  to  .39,  p 
<  .05  -  .0001.  Significant  correlations  existed  at  66  msec  between  t  correct 
and  SF  subscores  except  for  points  one  ranging  from  .15  to  .32  at  p 


28-17 


<  .05  -  .0001.  At  33  msec,  correlations  with  SF  subscores  except  for  velocity 
ranged  from  .18  -  .38,  p  <  .01  -  .0001. 

For  block  two,  male  response  times  were  negatively  correlated  with  SF 
subscores  i.e.s  66  msec  with  speed  2  at  -.17,  p  <  .05;  52  msec  with  points  two 
at  -.16,  p  <  .05;  and  33  msec  with  speed  two  at  -.14,  p  <  .05.  For  percent 
correct,  significant  correlations  existed  between  66  msec  and  SF  subteats  except 
for  speed  two;  ranging  from  .26  -  .29  at  p  <  .0001.  Correlations  between  52 
msec  and  SF  aubtests  except  for  speed  two  ranged  from  .17  -  .30  at  p 

<  .05  -  .0001.  For  33  msec  significant  correlations  existed  with  SF  subtests 
except  for  speed  two,  ranging  from  .12  -  .31  at  p  <  .05  -  .0001. 

For  females,  no  significant  correlations  existed  between  HT  and  SF  sub¬ 
scores  for  block  one  testing.  For  block  two,  there  were  no  significant  correla¬ 
tions  between  SF  subscores  and  percent  correct  at  33  msec  or  response  time  at  52 
msec  percent  correct  for  HT  was  correlated  with  control  two  at  .47,  p  <  .05  52 
msec  and  r  -  .45,  p  <  .05  for  66  msec. 

For  response  time,  HT  and  control  two  were  correlated  at  r  «  -.44,  p  <  .05 
for  33  msec.  At  66  msec,  HT  and  SF  subscores  except  control  two  were  signifi¬ 
cantly  correlated  ranging  from  -.39  to  .39,  p  <  .05. 

For  male  HT  average  scores  and  A8VAB  scores,  no  significant  correlations 
existed  between  WK  and  HT.  Existing  significant  correlations  were  low  (.13 
to  .24).  Correlations  between  response  times  and  ASVAB  subscores  tended  to  be 
negative  as  expected  (See  Addendum  Table  10). 

For  females,  no  significant  correlations  existed  between  HT  and  the  fol¬ 
lowing  ASVAB  subscores:  WK,  pc,  NO,  CS,  AS,  and  MK.  Percent  correct  was  corre¬ 
lated  with  GS  at  .56,  p  <  .01  for  66  msec  and  at  .49,  p  <  .05  for  52  msec. 
Percent  correct  was  correlated  with  AR  at  .46,  p  <  .05  for  66  msec.  Percent 
correct  was  correlated  with  MC  at  .59,  p  <  .01  for  66  msec  tnd  at  .53,  p  <  .01 
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for  52  msec.  Response  time  was  correlated  with  El  at  r  .39,  p  <  .05  for  52 
msec.  There  were  fewer  negative  correlations  between  ASVA8  aubscores  and  HT 
than  for  males  (See  Addendum  Table  11) 

DISCUSSION  AND  CONCLUSIONS 

In  general,  males  scored  slightly  higher  than  females  on  the  majority  of 
tests  and  subtests.  The  significance  of  this  however  is  not  yet  kno%m  due  to 
incomplete  analysis.  Large  standard  deriations  and  variances  represent  a  wide 
range  of  scores  suggesting  rather  high  as  well  as  low  scores  for  certain  indi¬ 
viduals.  This  also  reguires  further  testing  for  differences  between  means. 

Significant  correlations  which  existed  between  tests  and  subtests  were 
rather  low  for  the  majority  i.e.:  .1  to  .4  with  very  few  .5  or  above.  These 

represent  weak  relationships  although  statistically  significant. 

As  analyses  remains  in  progress,  it  is  difficult  to  come  to  a  conclusion. 
We  do  know  however,  that  these  tests  produce  a  wide  variety  of  scores  which  may 
prove  useful  in  future  pilot  selection  through  further  test  battery  refinement. 
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ADDENDUM 
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Table  10 


Correlation  Matrix  for  Male'a  ASVAB 
and  Hartnan  Task  (HT)  Scores 


X866 

RT 

xas2 

RT 

xa33 

RT 

GS 

.22*** 

-.09 

.21*** 

-.15**** 

.22*** 

-.01 

AR 

.07 

-.03 

in 

o 

-.06 

.13**** 

-.01 

UK 

.03 

.05 

.09 

-.05 

.10 

-.00 

PC 

.09 

o 

o 

.14**** 

-.04 

.09 

-.02 

MO 

.10 

-.09 

.11 

-.14**** 

.16**** 

-.11 

CS 

.11 

. , f 3***« 

.13**** 

-.24** 

.14**** 

. ^  ^^**** 

AS 

.09 

-.13**** 

.03 

-.05 

.01 

-.08 

HK 

.16**** 

-.04 

.17*** 

-.10 

,19*** 

-.02 

MC 

.24*** 

-.09 

.20*** 

-.02 

.23** 

.06 

El 

.10 

-.18*** 

.05 

-.10 

.08 

-.05 

*  p  <  .0001 

**  p  <  .001 

***  p  <  .01 

****  p  <  .05 
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Table  11 


Correlation  Matrix  for  Faaale's  ASVA8 
and  NartMn  Taak  (HT)  Scores 


XBOO 

RT 

9!a52 

RT 

RT 

OS 

.56*** 

-.07 

.49**** 

.03 

.26 

.02 

AR 

.46**** 

.24 

.37 

.08 

.18 

.04 

MC 

.34 

-.22 

.32 

-.05 

.08 

-.04 

PC 

.29 

-.29 

.11 

-.04 

-.09 

-.96 

NO 

.27 

.24 

.09 

.21 

-.17 

.02 

CS 

.37 

.08 

.24 

-.01 

-.04 

-.01 

AS 

.34 

-.19 

.31 

.11 

.17 

.18 

MIC 

.34 

.02 

.13 

-.09 

-.11 

-.15 

HC 

.59*** 

o 

1 

.53*** 

.11 

.21 

.09 

El 

.32 

.04 

.22 

.39**** 

.03 

.32 

p  <  .0001 
p  <  .001 
p  <  .01 

p  <  .05 
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SEE  VOLUME  2,  REPORT  NO.  25, 

FOR  COMPLETE  TEXT  OF  THIS  REPORT. 
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CoUabotative  Insouoional  Development  Bnvnonnicnt: 
A  Sage  forihe  A^A 


RobenG.  Main,  FhD. 
and 

Andrew  $.  Wilson,  MA 
Dqaninent  cf  Communicanon  Design 
California  Sate  Univasiiy.  Chko 


Computer-based  ineda  ppodiction  tools  have  matured  sufEciendy  a  enable  the  Air  Rxee  to  leadi])- provide  verv 
powerM  cuniculum  inaietials  d.:vdopment  tools  based  on  the  existing  Workstaocn  in  or  IV.  However,  pnniding  insBuoacnal 
desigtiea  and  devek)pesrs  with  a  multiinedia  developtnem  worlestatian  is  not  equivalent  to  providing  them  dte  power  to  use  them 
well  While  an  Automated  Insouctianal  Design  Advisor  wiD  oenainiy  aid  designers  and  developers  in  choosing  appropriate 
nsedia  to  solve  instruetkmai  problems,  provision  of  such  powerful  media  produeben  tools  vvill  lequiie  a  conunionent  within  the 
Air  Fbroe  a  provide  technical  and  creative  sqtpon.  Only  diis  wiD  ensure  effective,  rnodvational  media  design. 

An  examination  of  matured  computer-based  media  productian  tBChnoiogy  was  laxienaken  and  a  group  of  ISD  expens 
was  impanded  to  discern  which  available  tools  hold  die  most  promise  and  value  for  insttuctional  design.  This  study  presetss  the 
findings  of  die  De^ihi  panel  as  well  as  considering  the  impact  of  providing  such  to  designers  and  developers.  We  maice  the 

leconvnendation  that  implemeniatian  of  a  computer-mediaied  oonrunsiication  system  be  concuDcnt  wdh  the  enMacenKta  of 
cornputer-based  media  producticn  toob  to  create  acdlabondve  instnictional  developmeitt  environment  that  will  improve 
media  atadvity  and  dynamism  espedaUy  with  respea  ro  cornpmer-based  training.  In  addition,  such  a  system  win  provide  for 
centralized  archiving  of  reusable  and  repurposed  media,  effective  fonnative  and  summative  evaluation,  incitasedcollaborgion 
between  instructional  designers,  devekipers,  subjea  m^ter  expens  and  media  production  expens,  thus  increasing  insmicticnal 
quality,  employee  productivity  sid  job  satisfaction. 
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